Osteoprotegerin

ABSTRACT

The present invention discloses a novel secreted polypeptide, termed Osteoprotegerin, which is a member of the tumor necrosis factor receptor superfamily and is involved in the regulation of bone metabolism. Also disclosed are nucleic acids encoding Osteoprotegerin, polypeptides, recombinant vectors and host cells for expression, antibodies which bind Osteoprotegerin, and pharmaceutical compositions. The polypeptides are used to treat bone diseases characterized by increased resorption such as osteoporosis.

This application is a division of U.S. Ser. No. 08/577,788, filed Dec. 22, 1995.

FIELD OF THE INVENTION

The invention relates generally to polypeptides involved in the regulation of bone metabolism. More particularly, the invention relates to a novel polypeptide, termed osteoprotegerin, which is a member of the tumor necrosis factor receptor superfamily. The polypeptide is used to treat bone diseases characterized by increased bone loss such as osteoporosis.

BACKGROUND OF THE INVENTION

Polypeptide growth factors and cytokines are secreted factors which signal a wide variety of changes in cell growth, differentiation, and metabolism, by specifically binding to discrete, surface bound receptors. As a class of proteins, receptors vary in their structure and mode of signal transduction. They are characterized by having an extracellular domain that is involved in ligand binding, and cytoplasmic domain which transmits an appropriate intracellular signal. Receptor expression patterns ultimately determine which cells will respond to a given ligand, while the structure of a given receptor dictates the cellular response induced by ligand binding. Receptors have been shown to transmit intracellular signals via their cytoplasmic domains by activating protein tyrosine, or protein serine/threonine phosphorylation (e.g., platelet derived growth factor receptor (PDGFR) or transforming growth factor -β receptor -I (TGFβR-I), by stimulating G-protein activation (e.g., β-adrenergic Receptor), and by modulating associations with cytoplasmic signal transducing proteins (e.g., TNFR-1 and Fas/APO) (Heldin, Cell 80, 213-223 (1995)).

The tumor necrosis factor receptor (TNFR) superfamily is a group of type I transmembrane proteins which share a conserved cysteine-rich motif which is repeated three to six times in the extracellular domain (Smith, et al. Cell 76, 953-962 (1994)). Collectively, these repeat units form the ligand binding domains of these receptors (Chen et al., Chemistry 270, 2874-2878 (1995)). The ligands for these receptors are a structurally related group of proteins homologous to TNFα. (Goeddel et al. Cold Spring Harbor Symp. Quart. Biol. 51, 597-609 (1986); Nagata et al. Science 267, 1449-1456 (1995)). TNFα binds to distinct, but closely related receptors, TNFR-1 and TNFR-2. TNFα produces a variety of biological responses in receptor bearing cells, including, proliferation, differentiation, and cytotoxicity and apoptosis (Beutler et al. Ann. Rev. Biochem. 57, 505-518 (1988)).

TNFα is believed to mediate acute and chronic inflammatory responses (Beutler et al. Ann. Rev. Biochem. 57, 505-508 (1988)). Systemic delivery of TNFα induces toxic shock and widespread tissue necrosis. Because of this, TNFα may be responsible for the severe morbidity and mortality associated with a variety of infectious diseases, including sepsis. Mutations in FasL, the ligand for the TNFR-related receptor Fas/APO (Suda et al. Cell 75, 1169-1178 (1993)), is associated with autoimmunity (Fisher et al. Cell 81, 935-946 (1995)), while overproduction of FasL may be implicated in drug-induced hepatitis. Thus, ligands to the various TNFR-related proteins often mediate the serious effects of many disease states, which suggests that agents that neutralize the activity of these ligands would have therapeutic value. Soluble TNFR-1 receptors, and antibodies that bind TNFα, have been tested for their ability to neutralize systemic TNFα (Loetscher et al. Cancer Cells 3(6, 221-226 (1991)). A naturally occuring form of a secreted TNFR-1 mRNA was recently cloned, and its product tested for its ability to neutralize TNFα activity in vitro and in vivo (Kohno et al. PNAS USA 87, 8331-8335 (1990)). The ability of this protein to neutralize TNFα suggests that soluble TNFα receptors function to bind and clear TNF thereby blocking the cytotoxic effects on TNFR- bearing cells.

An object of the invention to identify new members of the TNFR super family. It is anticipated that new family members, may be transmembrane proteins or soluble forms thereof comprising extracellular domains and lacking transmembrane and cytoplasmic domains. We have identified a new member of the TNFR superfamily which encodes a secreted protein that is closely related to TNFR-2. By analogy to soluble TNFR-1, the TNFR-2 related protein may negatively regulate the activity of its ligand, and thus may be useful in the treatment of certain human diseases.

SUMMARY OF THE INVENTION

A novel member of the tumor necrosis factor receptor (TNFR) superfamily has been identified from a fetal rat intestinal cDNA library. A full-length cDNA clone was obtained and sequenced. Expression of the rat cDNA in a transgenic mouse revealed a marked increase in bones density, particularly in long bones, pelvic bone and vertebrae. The polypeptide encoded by the cDNA is termed osteoprotegerin and plays a role in promoting bone accumulation.

The invention provides for nucleic acids encoding a polypeptide having at least one of the biological activities of osteoprotegerin. Nucleic acids which hybridize to nucleic acids encoding mouse, rat or human Osteoprotegerin as shown in FIGS. 2B, 9A and 9B are also provided. Preferably, osteoprotegerin is mammalian osteoprotegerin and more preferably is human osteoprotegerin. Recombinant vectors and host cells expressing osteoprotegerin are also encompassed as are methods of producing recombinant osteoprotegerin. Antibodies or fragments thereof which specifically bind the polypeptide are also disclosed.

Methods of treating bone diseases are also provided by the invention. The polypeptides are useful for preventing bone resorption and may be used to treat any condition resulting in bone loss such as osteoporosis, hypercalcemia, Paget's disease of bone, and bone loss due to rheumatoid arthritis or osteomyelitis, and the like. Bone diseases may also be treated with anti-sense or gene therapy using nucleic acids of the invention. Pharmaceutical compositions comprising osteoprotegerin nucleic acids and polypeptides are also encompassed.

DESCRIPTION OF THE FIGURES

FIGS. 1A-1C. A. FASTA analysis of novel EST LORF. Shown is the deduced FRI-1 amino acid sequence aligned to the human TNFR-2 sequence. B. Profile analysis of the novel EST LORF shown is the deduced FRI-1 amino acid sequence aligned to the TNFR-profile. C. Structural view of TNFR superfamily indicating region which is homologous to the novel FRI-1.

FIGS. 2A-2C. Structure and sequence of full length rat Osteoprotegerin gene, a novel member of the TNFR superfamily. A. Map of pMOB-B1.1 insert. Box indicates position of LORF within the cDNA sequence (bold line). Black box indicates signal peptide, and gray ellipses indicate position of cysteine-rich repeat sequences. B. Nucleic acid and protein sequence of the Rat Osteoprotegerin cDNA. The predicted signal peptide is underlined, and potential sites of N-linked glycosylation are indicated in bold, underlined letters. C. Pileup sequence comparison (Wisconsin GCG Package, Version 8.1) of Osteoprotegerin with other members of the TNFR superfamily.

FIG. 3. PepPlot analysis (Wisconsin GCG Package, Version 8.1) of the predicted rat Osteoprotegerin protein sequence.

FIG. 4. mRNA expression patterns for the Osteoprotegerin cDNA in human tissues. Northern blots were probed with a 32P-labeled rat cDNA insert (left two panels), or with the human cDNA insert (right panel).

FIG. 5. Creation of transgenic mice expressing the Osteoprotegerin cDNA in hepatocytes. Northern blot expression of HE-Osteoprotegerin transgene in mouse liver.

FIGS. 6A-6B. Increase in bone density in Osteoprotegerin transgenic mice. Panel A. Control Mice. Panel B, Osteoprotegerin expressing mice. At necropsy, all animals were radiographed and photographs prepared. In A, the radiographs of the control animals and the one transgenic non-expressor (#28) are shown. Note that the bones have a clearly defined cortex and a lucent central marrow cavity. In contrast, the Osteoprotegerin (B) animals have a poorly defined cortex and increased density in the marrow zone.

FIGS. 7A-7B. Increase in trabecular bone in Osteoprotegerin transgenic mice. Plate A. Representative photomicrographs of bones from control animals. In A and B, low (4×, 10×) power images of the femurs are shown (Masson Trichrome stain). Stains for tartrate resistant acid phosphatase (TRAP) demonstrate osteoclasts (see arrows) both resorbing cartilage (C) and trabecular bone (D). Note the flattened appearance of osteoclasts on trabecular bone. Plate B. Representative photomicrographs of bones from Osteoprotegerin-expressing animals. In A and B, low (4×, 10×) power images of the femurs are shown (Masson Trichrome stain). The clear region is the growth plate cartilage, blue stained area is bone, and the red area is marrow. Note that in contrast to the controls, the trabecular bone has not been resorbed resulting in the absence of the usual marrow cavity. Also, the resulting trabeculae have a variegated appearance with blue and clear areas. The clear areas are remnants of growth plate cartilage that have never been remodelled. Based on TRAP stains, these animals do have osteoclasts (see arrows) at the growth plate (panel C), which may be reduced in number. However, the surfaces of the trabeculae away from the growth plate are virtually devoid of osteoclasts (D), a finding that stands in direct contrast with the control animals (see Plate A, Panal D).

FIGS. 8A-8D. HE-Osteoprotegerin expressors do not have a defect in monocyte-macrophage development. One cause for osteopetrosis in mice is defective M-CSF production due to a point mutation in the M-CSF gene. This results in a marked deficit of circulating and tissue based macrophages. The peripheral blood of Osteoprotegerin expressors contained monocytes as assessed by H1E analysis. To affirm the presence of tissue macrophages, immnohistochemistry was performed using F480 antibodies, which recognize a cell surface antigen on murine macrophages. Panels A and C show low power (4×) photomicrographs of the spleens from normal and CR1 overexpressors. Note that both animals have numerous F480 positive cells. Monocyte-macrophages were also present in the marrow of normal (B) and HE-Osteoprotegerin overexpressors (D) (40×).

FIGS. 9A-9C. Structure and sequence of mouse and human Osteoprotegerin cDNA clones. A. Mouse cDNA and protein sequence. B. Human cDNA and protein sequence. The predicted signal peptides are underlined, and potential sites of N-linked glycosylation are indicated in bold. C. Sequence alignment and comparison of rat, mouse and human Osteoprotegerin amino acid sequences.

FIGS. 10A-10C. Expression and secretion of full length and truncated mouse Osteoprotegerin-Fc fusion proteins. A. Map indicating points of fusion to the human IgG1 Fc domain are indicated by arrowheads. B. Silver stain of and SDS-polyacrylamide gel of conditioned media obtained from Fl.Fc (Full length Osteoprotegerin fused to Fc at Leucine 401) and CT.Fc (Carboxy-terminal truncated osteoprotegerin fused to Fc at threonine 180) fusion protein expression vectors. Lane 1, parent pCEP4 expression vector cell line; Lane 2, Fl.Fc vector cell line; Lane 3, CT.Fc vector cell line. C. Western blot of conditioned media obtained from Fl.Fc and CT.Fc fusion protein expression vectors probed with anti-human IgG1 Fc domain (Pierce). Lane 1, parent pCEP4 expression vector cell line; Lane 2, Fl.Fc vector cell line; Lane 3, CT.Fc vector cell line.

FIGS. 11A-11B. Expression of human Osteoprotegerin in E. coli. A. Construction of a bacterial expression vector. The LORF of the human Osteoprotegerin gene was amplified by PCR, then joined to a oligonucleotide linker fragment, and ligated into pAMG21 vector DNA. The resulting vector is capable of expressing Osteoprotegerin residues 32-401 linked to a N-terminal methionine residue. B SDS-PAGE analysis of uninduced and induced bacterial harboring the pAMG21-human Osteoprotegerin -32-401 plasmid. Lane 1, MW standards; lane 2, uninduced bacteria; lane 3, 30° C. induction; lane 4, 37° C. induction; lane 5, whole cell lysate from 37° C. induction; lane 6, soluble fraction of whole cell lysate; lane 7, insoluble fraction of whole cell lysate; lane 8, purified inclusion bodies obtained from whole cell lysate.

DETAILED DESCRIPTION OF THE INVENTION

A novel member of the tumor necrosis factor receptor superfamily was identified as an expressed sequence tag (EST) isolated from a fetal rat intestinal cDNA library . The structures of the full-length rat cDNA clones and the corresponding mouse and human cDNA clones were determined as described in Examples 1 and 6. The rat, mouse and human genes are shown in FIGS. 2A, 9A and 9B, respectively. All three sequences showed strong similarity to the extraceullular domains of TNFR family members. None of the full-length cDNA clones isolated encoded transmembrane and cytoplasmic domains that would be expected for membrane-bound receptors, suggesting that these cDNAs encode soluble, secreted proteins rather than cell surface receptors. A portion of the human genes spanning nucleotides 1200-1353 shown in FIGS. 9B was deposited in the Genebank database on Nov. 22, 1995 under accession no. 17188769.

The tissue distribution of the rat and human mRNA was determined as described in Example 2. In rat, mRNA expression was detected in kidney, liver, placenta and heart with the highest expression in the kidney. Expression in skeletal muscle and pancreas was also detected. In humans, expression was detected in the same tissues along with lymph node, thymus, spleen and appendix.

The rat cDNA was expressed in transgenic mice (Example 3) using the liver-specific ApoE promoter expression system. Analysis of expressors showed a marked increase in bone density, particularly in long bones (femurs), vertebrae and flat bones (pelvis). Histological analysis of stained sections of bone showed severe osteopetrosis (see Example 4) indicating a marked imbalance between bone formation and resorption which has led to a marked accumulation of bone and cartilage. A decrease in the number of trabecular osteoclasts in the bones of Osteoprotegerin expresser animals indicate that a significant portion of the activity of the TNFR-related protein may be to prevent bone resorption, a process mediated by osteoclasts. In view of the activity in transgenic expressors, the TNFR-related proteins described herein are termed osteoprotegerins.

Using the rat cDNA sequence, mouse and human cDNA clones were isolated (Example 5). Expression of mouse Osteoprotegerin in 293 cells and human osteoprotegerin in E. coli is described in Examples 6 and 7. Mouse Osteoprotectin was produced as an Fc fusion which was purified by Protein A affinity chromatography.

Osteoprotegerin may be important in regulating bone resorption. The protein appears to act as a soluble receptor of the TNF family and may prevent a receptor-ligand interaction involved in the osteolytic pathway. One aspect of the regulation appears to be a reduction in the number of osteoclasts.

Nucleic Acids

The invention provides for an isolated nucleic acid encoding a polypeptide having at least one of the biological activities of osteoprotegerin. As described herein, the biological activities of Osteoprotegerin include, but are not limited to, any activity involving bone metabolism and in particular, include increasing bone density. The nucleic acids of the invention are selected from the following:

a) the nucleic acid sequences as shown in FIGS. 2B, 9A and 9B or complementary strands thereof;

b) the nucleic acids which hybridize under stringent conditions with the polypeptide-encoding region in FIGS. 2B, 9A and 9B; and

c) nucleic acids which hybridize under stringent conditions with nucleotides 148 through 337 inclusive as shown in FIG. 2B.

d) the nucleic acid sequences which are degenerate to the sequences in (a) and (b).

The invention provides for nucleic acids which encode rat, mouse and human Osteoprotegerin as well as nucleic acid sequences hybridizing thereto which encode a polypeptide having at least one of the biological activities of Osteoprotegerin. Also provides for nucleic acids which hybridize to a rat osteoprotegerin EST encompassing nucleotides 148-337 as shown in FIG. 2B. The conditions for hybridization are generally of high stringency such as 5×SSC, 50% formamide and 42° C. described in Example 1 of the specification. Equivalent stringency to these conditions may be readily obtained by adjusting salt and organic solvent concentrations and temperature. The nucleic acids in (b) encompass sequences encoding Osteoprotegerin-related polypeptides which do not undergo detectable hybridization with other known members of the TNF receptor superfamily. In a preferred embodiment, the nucleic acids are as shown in FIGS. 2A, 9A and 9B.

DNA encoding rat osteoprotegerin was provided in plasmid pMO-B1.1 deposited with the American Type Culture Collection, Rockville, Md on Dec. 27, 1993 under ATCC accession no. 69970. DNA encoding mouse Osteoprotegerin was provided in plasmid pRcCMV-murine Osteoprotegerin deposited with the American Type Culture Collection, Rockville, Md. on Dec. 27, 1995 under accession no. 69971. DNA encoding human Osteoprotegerin was provided in plasmid pRcCMV--human Osteoprotegerin deposited with the American Type Culture Collection, Rockville, Md. Dec. 27, 1995 on under accession no. 69969. Further, the nucleic acids of the invention will hybridize under stringent conditions to the DNA inserts of ATCC accession nos. 69969, 69970, and 69971 and have at least one of the biological activities of osteoprotegerin.

Also provided by the invention are derivatives of the nucleic acid sequences as shown in FIGS. 2A, 9A and 9B. As used herein, derivatives include nucleic acid sequences having addition, substitution insertion or deletion of one or more residues such that the resulting sequences encode polypeptides having one or more amino acid residues which have been added, deleted, inserted or substituted and the resulting polypeptide has the activity of Osteoprotegerin. The nucleic acid derivatives may be naturally occurring, such as by splice variation or polymorphism, or may be constructed using site-directed mutagenesis techniques available to the skilled worker. It is anticipated that nucleic acid derivatives will encode amino acid changes in regions of the molecule which are least likely to disrupt biological activity. Other derivatives include a nucleic acid encoding a membrane-bound form of Osteoprotegerin having an extracellular domain as shown in FIGS. 2B, 9A and 9B along with transmembrane and cytoplasmic domains.

Examples of the nucleic acids of the invention include cDNA, genomic DNA, synthetic DNA and RNA. cDNA is obtained from libraries prepared from mRNA isolated from various tissues expressing Osteoprotegerin. In humans, tissue sources for Osteoprotegerin include kidney, liver, placenta and heart. Genomic DNA encoding Osteoprotegerin is obtained from genomic libraries which are commercially available from a variety of species. Synthetic DNA is obtained by chemical synthesis of overlapping oligonucleotide fragments followed by assembly of the fragments to reconstitute part or all of the coding region and flanking sequences (see U.S. Pat. No. 4,695,623 describing the chemical synthesis of interferon genes). RNA is obtained most easily by procaryotic expression vectors which direct high-level synthesis of mRNA, such as vectors using T7 promoters and RNA polymerase.

Nucleic acid sequences of the invention are used for the detection of Osteoprotegerin sequences in biological samples in order to determine which cells and tissues are expressing Osteoprotegerin mRNA. The sequences may also be used to screen cDNA and genomic libraries for sequences related to Osteoprotegerin. Such screening is well within the capabilities of one skilled in the art using appropriate hybridization conditions to detect homologus sequences. The nucleic acids are also useful for modulating the expression of Osteoprotegerin levels by anti-sense therapy or gene therapy. The nucleic acids are also used for the development of transgenic animals which may be used for the production of the polypeptide and for the study of biological activity (see Example 3).

Vectors and Host Cells

Expression vectors containing nucleic acid sequences encoding Osteoprotegerin, host cells transformed with said vectors and methods for the production of Osteoprotegerin are also provided by the invention. An overview of expression of recombinant proteins is found in Methods of Enzymology v. 185 (Goeddel, D. V. ed.) Academic Press (1990).

Host cells for the production of Osteoprotegerin include procaryotic host cells, such as E. coli, yeast, plant, insect and mammalian host cells. E. coli strains such as HB101 or JM101 are suitable for expression. Preferred mammalian host cells include COS, CHOd-, 293, CV-1, 3T3, baby hamster kidney (BHK) cells and others. Mammalian host cells are preferred when post-translational modifications, such as glycosylation and polypeptide processing, are important for Osteoprotegerin activity. Mammalian expression allows for the production of secreted polypeptides which may be recovered from the growth medium.

Vectors for the expression of Osteoprotegerin contain at a minimum sequences required for vector propogation and for expression of the cloned insert. These sequences include a replication origin, selection marker, promoter, ribosome binding site, enhancer sequences, RNA splice sites and transcription termination site. Vectors suitable for expression in the aforementioned host cells are readily available and the nucleic acids of the invention are inserted into the vectors using standard recombinant DNA techniques. Vectors for tissue-specific expression of Osteoprotegerin are also included. Such vectors include promoters which function specifically in liver, kidney or other organs for production in mice, and viral vectors for the expression of Osteoprotegerin in targeted human cells.

Using an appropriate host-vector system, Osteoprotegerin is produced recombinantly by culturing a host cell transformed with an expression vector containing nucleic acid sequences encoding Osteoprotegerin under conditions such that Osteoprotegerin is produced, and isolating the product of expression. Osteoprotegerin is produced in the supernatant of transfected mammalian cells or in inclusion bodies of transformed bacterial host cells. Osteoprotegerin so produced may be purified by procedures known to one skilled in the art as described below. The expression of Osteoprotegerin in mammalian and bacterial host systems is described in Example 6 and 7. It is anticipated that the specific plasmids and host cells described are for illustrative purpose and that other available plasmids and host cells could also be used to express the polypeptides.

The invention also provides for expression of Osteoprotegerin from endogenous nucleic acids by in vivo or ex vivo recombination events to allow modulation of Osteoprotegerin from the host chromosome.

Polypeptides

The invention provides for Osteoprotegerin, a novel member of the TNF receptor superfamily, having an activity associated with bone metabolism and in particular having the activity of inhibiting bone resorption thereby increasing bone density. Osteoprotegerin refers to a polypeptide having an amino acid sequence of mouse, rat or human Osteoprotegerin or a derivative thereof having at least one of the biological activities of Osteoprotegerin. The amino acid sequences of rat, mouse and human osteoprotegerin are shown in FIGS. 2A, 9A and 9B respectively. A derivative of Osteoprotegerin refers to a polypeptide having an addition, deletion, insertion or substitution of one or more amino acids such that the resulting polypeptide has at least one of the biological activities of Osteoprotegerin. The biological activities of Osteoprotegerin include, but are not limited to, activities involving bone metabolism. Preferably, the polypeptides will have the amino terminal leader sequence of 21 amino acids removed.

Osteoprotegerin polypeptides encompassed by the invention include rat [1-401], rat [22-180], rat [22-401], rat [22-401]-Fc fusion, rat [1-180]-Fc fusion, mouse [1-401], mouse [1-180], mouse [22-401], human [1-401], mouse [22-180], human [22-401], human [22-180], human [1-180], human [22-180]-Fc fusion and human met-32-401. Amino acid numbering is as shown in SEQ ID NO: 2 (rat), SEQ ID NO: 4 (mouse) and SEQ ID NO: 6 (human). Also encompassed are polypeptide derivatives having deletions or carboxy-terminal truncations of part or all of amino acids residues 180-401 of Osteoprotegerin; one or more amino acid changes in residues 180-401; deletion of part or all of a cysteine-rich domain of Osteoprotegerin, in particular deletion of the distal (carboxy-terminal) cysteine-rich domain; and one or more amino acid changes in a cysteine-rich domain, in particular in the distal (carboxy-terminal) cysteine-rich domain.

Modifications of Osteoprotegerin polypeptides are encompassed by the invention and include post-translational modifications (e.g., N-linked or O-linked carbohydrate chains, processing of N-terminal or C-terminal ends), attachment of chemical moieties to the amino acid backbone, chemical modifications of N-linked or O-linked carbohydrate chains, and addition of an N-terminal methionine residue as a result of procaryotic host cell expression. The polypeptides may also be modified with a detectable label, such as an enzymatic, fluorescent, isotopic or affinity label to allow for detection and isolation of the protein.

Further modifications of Osteoprotegerin include chimeric proteins wherein Osteoprotegerin is fused to a heterologous amino acid sequence. The heterologous sequence may be any sequence which allows the resulting fusion protein to retain the activity of Osteoprotegerin. The heterologous sequences include for example, immunoglobulin fusions, such as Fc fusions, which may aid in purification of the protein.

The polypeptides of the invention are isolated and purified from other polypeptides present in tissues, cell lines and transformed host cells expressing Osteoprotegerin, or purified from components in cell cultures containing the secreted protein. In one embodiment, the polypeptide is free from association with other human proteins, such as the expression product of a bacterial host cell.

Also provided by the invention are chemically modified derivatives of osteoprotegerin which may provide additional advantages such as increasing stability and circulating time of the polypeptide, or decreasing immunogenicity (see U.S. Pat. No. 4,179,337). The chemical moieties for derivitization may be selected from water soluble polymers such as polyethylene glycol, ethylene glycol/propylene glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The polypeptides may be modified at random positions within the molecule, or at predetermined positions within the molecule and may include one, two, three or more attached chemical moieties.

A method for the purification of Osteoprotegerin from natural sources and from transfected host cells is also included. The purification process may employ one or more standard protein purification steps in an appropriate order to obtain purified protein. The chromatography steps can include ion exchange, gel filtration, hydrophobic interaction, reverse phase, chromatofocusing, affinity chromatography employing an anti-Osteoprotegerin antibody or biotin-streptavidin affinity complex and the like.

Antibodies

Also encompassed by the invention are antibodies specifically binding to Osteoprotegerin. Antigens for the generation of antibodies may be full-length polypeptides or peptides spanning a portion of the Osteoprotegerin sequence. Immunological procedures for the generation of polyclonal or monoclonal antibodies reactive with Osteoprotegerin are known to one skilled in the art (see, for example, Harlow and Lane Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory Press, Cold Spring Harbor N.Y. (1988)). Antibodies so produced are characterized for binding specificity and epitope recognition using standard enzyme-linked immunosorbent assays. Antibodies also include chimeric antibodies having variable and constant domain regions derived from different species. In one embodiment, the chimeric antibodies are humanized antibodies having murine variable domains and human constant domains. Also encompassed are complementary determining regions grafted to a human framework (so-called CDR-grafted antibodies). Chimeric and CDR-grafted antibodies are made by recombinant methods known to one skilled in the art. Also encompassed are human antibodies made in mice.

Anti-osteoprotegerin antibodies of the invention may be used as an affinity reagent to purify Osteoprotegerin from biological samples. In one method, the antibody is immobilized on CnBr-activated Sepharose and a column of antibody-Sepharose conjugate is used to remove Osteoprotegerin from liquid samples. Antibodies are also used as diagnostic reagents to detect and quantitate Osteoprotegerin in biological samples by methods described below.

Pharmaceutical Compositions

The invention also provides for pharmaceutical compositions comprising a therapeutically effective amount of the polypeptide of the invention together with a pharmaceutically acceptable diluent, carrier, solubilizer, emulsifier, preservative and/or adjuvant. The term "therapeutically effective amount" means an amount which provides a therapeutic effect for a specified condition and route of administration. The composition may be in a liquid or lyophilized form and comprises a diluent (Tris, acetate or phosphate buffers) having various pH values and ionic strengths, solubilizer such as Tween or Polysorbate, carriers such as human serum albumin or gelatin, preservatives such as thimerosal or benzyl alcohol, and antioxidants such as ascrobic acid or sodium metabisulfite. Also encompassed are compositions comprising Osteoprotegerin modified with water soluble polymers to increase solubility or stability. Compositions may also comprise incorporation of Osteoprotegerin into liposomes, microemulsions, micelles or vesicles for controlled delivery over an extended period of time. Selection of a particular composition will depend upon a number of factors, including the condition being treated, the route of administration and the pharmacokinetic parameters desired. A more extensive survey of component suitable for pharmaceutical compositions is found in Remington's Pharmaceutical Sciences, 18th ed. A. R. Gennaro, ed. Mack, Easton, Pa. (1980).

Compositions of the invention may be administered by injection, either subcutaneous, intravenous or intramuscular, or by oral, nasal, pulmonary or rectal administration. The route of administration eventually chosen will depend upon a number of factors and may be ascertained by one skilled in the art.

The invention also provides for pharmaceutical compositions comprising a therapeutically effective amount of the nucleic acids of the invention together with a pharmaceutically acceptable adjuvant. Nucleic acid compositions will be suitable for the delivery of part or all of the Osteoprotegerin coding region to cells and tissues as part of an anti-sense or gene therapy regimen.

Methods of Treatment

Bone tissue provides support for the body and consists of mineral (largely calcium and phosphorous), a matrix of collagenous and noncollagenous proteins, and cells. Three types of cells found in bone, osteocytes, osteoblasts and osteoclasts, are involved in the dynamic process by which bone is continually formed and resorbed. Osteoblasts promote formation of bone tissue whereas osteoclasts are associated with resorption. Resorption, or the dissolution of bone matrix and mineral, is a fast and efficient process compared to bone formation and can release large amounts of mineral from bone. Osteoclasts are involved in the regulation of the normal remodeling of skeletal tissue and in resorption induced by hormones. For instance, resorption is stimulated by the secretion of parathyroid hormone in response to decreasing concentrations of calcium ion in extracellular fluids. In contrast, inhibition of resorption is the principal function of calcitonin. In addition, metabolites of vitamin D alter the responsiveness of bone to parathyroid hormone and calcitonin.

After skeletal maturity, the amount of bone in the skeleton reflects the balance (or imbalance) of bone formation and bone resorption. Peak bone mass occurs after skeletal maturity prior to the fourth decade. Between the fourth and fifth decades, the equilibrium shifts and bone resorption dominates. The inevitable decrease in bone mass with advancing years starts earlier in females than males and is distinctly accelerated after menopause in some females (principally those of Caucasian and Asian descent).

Osteopenia is a condition relating generally to any decrease in bone mass to below normal levels. Such a condition may arise from a decrease in the rate of bone synthesis or an increase in the rate of bone destruction or both. The most common form of osteopenia is primary osteoporosis, also referred to as postmenopausal and senile osteoporosis. This form of osteoporosis is a consequence of the universal loss of bone with age and is usually a result of increase in bone resorption with a normal rate of bone formation. About 25 to 30 percent of all white females in the United States develop symptomatic osteoporosis. A direct relationship exists between osteoporosis and the incidence of hip, femoral, neck and inter-trochanteric fracture in women 45 years and older. Elderly males develop symptomatic osteoporosis between the ages of 50 and 70, but the disease primarily affects females.

The cause of postmenopausal and senile osteoporosis is unknown. Several factors have been identified which may contribute to the condition. They include alteration in hormone levels accompanying aging and inadequate calcium consumption attributed to decreased intestinal absorption of calcium and other minerals. Treatments have usually included hormone therapy or dietary supplements in an attempt to retard the process. To date, however, an effective treatment for bone loss does not exist.

The invention provides for a method of treating a bone disorder using a therapeutically effective amount of Osteoprotegerin. The bone disorder may be any disorder characterized by a net bone loss (osteopenia or osteolysis). In general, treatment with Osteoprotegerin is anticipated when it is necessary to suppress the rate of bone resorption. Thus treatment may be done to reduce the rate of bone resorption where the resorption rate is above normal or to reduce bone resorption to below normal levels in order to compensate for below normal levels of bone formation.

Conditions which are treatable with osteoprotegerin include the following:

Osteoporosis, such as primary osteoporosis, endocrine osteoporosis (hyperthyroidism, hyperparathryoidism, Cushing's syndrome, and acromegaly), hereditary and congenital forms of osteoporosis (osteogenesis imperfecta, homocystinuria, Menkes' syndrome, and Riley-Day syndrome) and osteoporosis due to immobilization of extremities.

Paget's disease of bone (osteitis deformans) in adults and juveniles

Osteomyelitis, or an infectious lesion in bone, leading to bone loss.

Hypercalcemia resulting from solid tumors (breast, lung and kidney) and hematologic malignacies (multiple myeloma, lymphoma and leukemia); idiopathic hypercalcemia, and hypercalcemia associated with hyperthryoidism and renal function disorders.

Osteopenia following surgery, induced by steroid administration, and associated with disorders of the small and large intestine and with chronic hepatic and renal diseases.

Osteonecrosis, or bone cell death, associated with traumatic injury or nontraumatic necrosis associated with Gaucher's disease, sickle cell anemia, systemic lupus erythematosus and other conditions.

Bone loss due to rheumatoid arthritis.

Periodontal bone loss.

Osteolytic metastasis

It is understood that Osteoprotegerin may be used alone or in conjunction with other factors for the treatment of bone disorders. In one embodiment, osteoprotegerein is used in conjunction with a therapeutically effective amount of a factor which stimulates bone formation. Such factors include but are not limited to the bone morphogenic factors designated BMP-1 through BMP-12, transforming growth factor -β (TGF-β) and TGF-β family members, interleukin-1 inhibitors, TNFα inhibitors, parathyroid hormone and analogs thereof, parathyroid related protein and analogs thereof, E series prostaglandins, bisphosphonates (such as alendronate and others), and bone-enhancing minerals such as fluoride and calcium.

The following examples are offered to more fully illustrate the invention, but are not construed as limiting the scope thereof.

EXAMPLE 1 Identification and Isolation of the Rat Osteoprotegerin cDNA

Materials and method for cDNA closing and analysis are described in Maniatis et al. Molecular Cloning, 2d ed., CHSL Press (1989). A cDNA library was constructed using mRNA isolated from embryonic d20 intestine for EST analysis (Adams et al. Science 252: 1651-1656 (1991)). Rat embryos were dissected, and the entire developing small and large intestine removed and washed in PBS. Total cell RNA was purified by acid guanidinium thiocyanate-phenol-chloroform extraction (Chomczynski and Sacchi Anal. Biochem. 162, 156-159, (1987)). The poly (A+) mRNA fraction was obtained from the total RNA preparation by adsorption to, and elution from, Dynabeads Oligo (dT)25 (Dynal Corp) using the manufacturer's recommended procedures. A random primed cDNA library was prepared using the Superscript Plasmid System (Gibco BRL, Gaithersburg, Md.). The random cDNA primer containing an internal Not I restriction site was used to initiate first strand synthesis and had the following sequence:

    (SEQ ID NO:7)                                                                  5'-AAAGGAAGGAAAAAAGCGGCCGCTACANNNNNNNNT-3'                                                         Not I                                                 

For the first strand synthesis three separate reactions were assembled that contained 2.5 ug of poly(A) RNA and 120 ng, 360 ng or 1,080 ng of random primer. After second strand synthesis, the reaction products were separately extracted with a mixture of phenol:choroform:isoamyl alcohol (25:24:1 ratio), and then ethanol precipitated. The double strand (ds) cDNA products of the three reactions were combined and ligated to the following ds oligonucleotide adapter:

5'-TCGACCCACGCGTCCG-3' (SEQ ID No:8)

3'-GGGTGCGCAGGCp-5' (SEQ ID No:9)

After ligation the cDNA was digested to completion with Not I, extracted with phenol:chloroform:isoamyl (25:24:1) alcohol and ethanol precipitated. The resuspended cDNA was then size fractionated by gel filtration using premade columns provided with the Superscript Plasmid System (Gibco BRL, Gaithersburg, Md.) as recommended by the manufacturer. The two fractions containing the largest cDNA products were pooled, ethanol precipitated and then directionally ligated into Not I and Sal I digested pMOB vector DNA (Strathmann et al, 1991). The ligated cDNA was introduced into competent ElectroMAX DH10B E. coli (Gibco BRL, Gaithersburg, Md.) by electroporation. For automated sequence analysis approximately 10,000 transformants were plated on 20 cm×20 cm agar plates containing ampicillin supplemented LB nutrient media. The colonies that arose were picked and arrayed onto 96 well microtiter plates containing 200 μl of L-broth, 7.5% glycerol, and 50 μg/ml ampicillin. The cultures were grown overnight at 37° C., a duplicate set of microtiter plates were made using a sterile 96 pin replicating tool, then both sets were stored at -80° C. for further analysis. For full-length cDNA cloning approximately one million transformants were plated on 96 bacterial ampicillin plates containing about 10,000 clones each. The plasmid DNA from each pool was separately isolated using the Qiagen Plasmid Maxi Kit (Qiagen Corp.,Germany) and arrayed into 96 microtiter plates for PCR analyses.

To sequence random fetal rat intestine cDNA clones, glycerol stocks were thawed, and small aliquots diluted 1:25 in distilled. Approximately 3.0 ul of diluted bacterial cultures were added to PCR reaction mixture (Boehringer-Mannheim) containing the following oligonucleotides:

5'-TGTAAAACGACGGCCAGT-3' (SEQ ID No:10)

5'-CAGGAAACAGCTATGACC-3' (SEQ ID No:11)

The reactions were incubated in a thermocycler (Perkin-Elmer 9600) with the following cycle conditions: 94 C for 2 minutes; 30 cycles of 94 C for 5 seconds, 50 C for 5 seconds, and 72 C for 3 minutes.; 72 C for 4 minutes. After incubation in the thermocycler, the reactions were diluted with 2.0 mL of water. The amplified DNA fragments were further purified using Centricon columns (Princeton Separations) using the manufacturer's recommended procedures. The PCR reaction products were sequenced on an Applied Biosystems 373A automated DNA sequencer using T3 primer (oligonucleotide 353-23; 5'-CAATTAACCCTCACTAAAGG-3') (SEQ ID No:12) Taq dye-terminator reactions (Applied Biosystems) following the manufacturer's recommended procedures.

The resulting 5' nucleotide sequence obtained from randomly picked cDNA clones translated and then compared to the existing database of known protein sequences using a modified version of the FASTA program (Pearson et al. Meth. Enzymol. 183, (1990)). Translated sequences were also analysed for the presence of a specific cysteine-rich protein motif found in all known members of the tumor necrosis factor receptor (TNFR) superfamily (Smith et al. Cell 76, 959-962 (1994)), using the sequence profile method of Gribskov et al. (PNAS USA 83, 4355-4359 (1987), as modified by Luethy et al. (Protein Science 3, 139-146 (1994)).

Using the FASTA and Profile search data, an EST, FRI-1 (Fetal Rat Intestine-1), was identified as a possible new member of the TNFR superfamily. FRI-1 contained an approximately 600 bp insert with a LORF of about 150 amino acids. The closest match in the database was the human type II TNFR (TNFR-2). The region compared showed an ˜43% homology between TNFR-2 and FRI-1 over this 150 aa LORF. Profile analysis using the first and second cysteine-rich repeats of the TNFR superfamily yielded a Z score of ˜8, indicating that the FRI-1 gene possibly encodes a new family member. To deduce the structure of the FRI-1 product, the fetal rat intestine cDNA library was screened for full length clones. The following oligonucleotides were derived from the original FRI-1 sequence:

5'-GCATTATGACCCAGAAACCGGAC-3' (SEQ ID No:13)

5'-AGGTAGCGCCCTTCCTCACATTC-3 (SEQ ID No:14)

These primers were used in PCR reactions to screen 96 pools of plasmid DNA, each pool containing plasmid DNA from 10,000 independent cDNA clones. Approximately 1 ug of plasmid pool DNA was amplified in a PCR reaction mixture (Boehringer-Mannheim) using a Perkin-Elmer 96 well thermal cycler with the following cycle conditions: 2 min at 94° C., 1 cycle; 15 sec at 94° C., then 45 sec at 65° C., 30 cycles; 7 min at 65° C., 1 cycle. PCR reaction products were analysed by gel electrophoresis. 13 out of 96 plasmid DNA pools gave rise to amplified DNA products with the expected relative molecular mass.

DNA from one positive pool was used to transform competent ElectroMAX DH10B E. coli (Gibco BRL, Gaithersburg, Md) as described above. Approximately 40,000 transformants were plated onto sterile nitrocellulose filters (BA-85, Schleicher and Schuell), and then screened by colony hybridization using a ³² p-dCTP labelled version of the PCR product obtained above. Filters were prehybridized in 5× SSC, 50% deionized formamide, 5× Denhardt's solution, 0.5% SDS, and 100 ug/ml denatured salmon sperm DNA for 2-4 hours at 42° C. Filters were then hybridized in 5× SSC, 50% deionized formamide, 2× Denhardt's solution, 0.1% SDS, 100 ug/ml denatured salmon sperm DNA, and ˜5 ng/ml of labelled probe for ˜18 hours at 42° C. The filters were then washed in 2× SSC for 10 min at RT, 1× SSC for 10 min at 55° C., and finally in 0.5× SSC for 10-15 min at 55° C. Hybridizing clones were detected following autoradiography, and then replated onto nitrocellulose filters for secondary screening. Upon secondary screening, a plasmid clone (pB1.1) was isolated, then amplified in L-broth media containing 100 ug/ml ampicillin and the plasmid DNA obtained. Both strands of the 2.4 kb pB1.1 insert were sequenced.

The pB1.1 insert sequence was used for a FASTA search of the public database to detect any existing sequence matches and/or similarities. No matches to any known genes or EST's were found, although there was an approximate 45% similarity to the human and mouse TNFR-2 genes. A methionine start codon is found at bp 124 of the nucleotide sequence, followed by a LORF encoding 401 aa residues that terminates at bp 1327. The 401 aa residue product is predicted to have a hydrophobic signal peptide of approximately 31 residues at its N-terminus, and 4 potential sites of N-linked glycosylation. No hydrophobic transmembrane spanning sequence was identified using the PepPlot program (Wisconsin GCG package, version 8.1). The deduced 401 aa sequence was then used to search the protein database. Again, there were no existing matches, although there appeared to be a strong similarity to many members of the TNFR superfamily, most notably the human and mouse TNFR-2. A sequence alignment of this novel protein with known members of the TNFR-superfamily was prepared using the Pileup program, and then modified by PrettyPlot (Wisconsin GCG package, version 8.1). This alignment shows a clear homology between the full length FRI-1 gene product and all other TNFR family members. The homologus region maps to the extracellular domain of TNFR family members, and corresponds to the three or four cysteine-rich repeats found in the ligand binding domain of these proteins. This suggested that the FRI-1 gene encoded a novel TNFR family member. Since no transmembrane spanning region was detected we predicted that this may be a secreted receptor, similar to TNFR-1 derived soluble receptors (Kohno et al. PNAS USA 87, 8331-8335 (1990)). Due to the apparent biological activity of the FRI-1 gene (vide infra), the product was named Osteoprotegerin.

EXAMPLE 2 Osteoprotegerin mRNA Expression Patterns in Tissues

Multiple human tissue northern blots (Clonetech) were probed with a 32P-dCTP labelled FRI-1 PCR product to detect the size of the human transcript and to determine patterns of expression. Northern blots were prehybridized in 5× SSPE, 50% formamide, 5× Denhardt's solution, 0.5% SDS, and 100 ug/ml denatured salmon sperm DNA for 2-4 hr at 42° C. The blots were then hybridized in 5× SSPE, 50% formamide, 2× Denhardt's solution, 0.1% SDS, 100 ug/ml denatured salmon sperm DNA, and 5 ng/ml labelled probe for 18-24 hr at 42° C. The blots were then washed in 2× SSC for 10 min at RT, 1× SSC for 10 min at 50° C., then in 0.5× SSC for 10-15 min.

Using a probe derived from the rat gene, a predominant mRNA species with a relative molecular mass of about 2.4 kb is detected in several tissues, including kidney, liver, placenta, and heart. Highest levels are detected in the kidney. A large mRNA species of Mr 4.5 and 7.5 kb was detected in skeletal muscle and pancreas. In human fetal tissue, kidney was found to express relatively high levels of the 2.4 kb mRNA. Using a human probe (vide infra), only the 2.4 kb transcript is detected in these same tissues. In addition, relatively high levels of the 2.4 kb transcript was detected in the lymph node, thymus, spleen and appendix. The size of the transcript detected by both the rat and human Osteosprotegerin gene is almost identical to the length of the rat pB1.1 FRI-1 insert, suggesting it was a full length cDNA clone.

EXAMPLE 3 Systemic Delivery of Osteoprotegerin in Transgenic Mice

The rat Osteoprotegerin clone pB1.1 was used as template to PCR amplify the coding region for subcloning into an ApoE-liver specific expression vector (Simonet et al. J. Clin. Invest. 94, 1310-1319 (1994), and PCT Application No. US94/11675 and co-owned U.S. Ser. No. 08/221,767. The following 5' and 3' oligonucleotide primers were used for PCR amplification, respectively:

5'-GACTAGTCCCACAATGAACAAGTGGCTGTG-3' (SEQ ID No:15)

5'-ATAAGAATGCGGCCGCTAAACTATGAAACAGCCCAGTGACCATTC-3' (SEQ ID No:16)

The PCR reaction mixture (Boehringer-Mannheim) was treated as follows: 94° C. for 1 minute, 1 cycle; 94° C. for 20 sec, 62° C. for 30 sec, and 74° C. for 1 minute, 25 cycles. Following amplification, the samples were purified over Qiagen PCR columns and digested overnight with SpeI and NotI restriction enzymes. The digested products were extracted and precipitated and subcloned into the ApoE promoter expression vector. Prior to microinjecting the resulting clone, HE-Osteoprotegerin, it was sequenced to ensure it was mutation-free.

The HE-Osteoprotegerin plasmid was purified through two rounds of CsCl density gradient centrifugation. The purified plasmid DNA was digested with XhoI and Ase I, and the 3.6 kb transgene insert was purified by gel electrophoresis. The purified fragment was diluted to a stock injection solution of 1 ug/ml in 5 mM Tris, pH 7.4, 0.2 mM EDTA. Single-cell embryos from BDF1×BDF1-bred mice were injected essentially as described (Brinster et al., PNAS USA 82, 4338 (1985)), except that injection needles were beveled and siliconized before use. Embryos were cultured overnight in a CO₂ incubator and 15 to 20 2-cell embryos were transferred to the oviducts of pseudopregnant CD1 female mice.

Following term pregnancy, 49 offspring were obtained from implantation of microinjected embryos. The offspring were screened by PCR amplification of the integrated transgene in genomic DNA samples. The target region for amplification was a 369 bp region of the human Apo E intron which was included in the expression vector. The oligos used for PCR amplification were:

5'-GCC TCT AGA AAG AGC TGG GAC-3' (SEQ ID No:17)

5'-CGC CGT GTT CCA TTT ATG AGC-3' (SEQ ID No:18)

The conditions for PCR were: 94° C. for 2 minute, 1 cycle; 94° C. for 1 min, 63° C. for 20 sec, and 72° C. for 30 sec, 30 cycles. Of the 49 original offspring, 9 were identified as PCR positive transgenic founders.

At 8-10 weeks of age, five transgenic founders (2, 11, 16, 17, and 28) and five controls (1, 12, 15, 18, and 30) were sacrificed for necropsy and pathological analysis. Liver was isolated from the remaining 4 founders by partial hepatectomy. For partial hepatectomy, the mice were anesthetized and a lobe of liver was surgically removed. Total cellular RNA was isolated from livers of all transgenic founders, and 5 negative control littermates as described (McDonald et al. Meth. Enzymol. 152, 219 (1987)). Northern blot analysis was performed on these samples to assess the level of transgene expression. Approximately 10 ug of total RNA from each animal liver was resolved by electrophoresis denaturing gels (Ogden et al. Meth. Enzymol 152, 61 (1987)), then transferred to HYBOND-N nylon membrane (Amersham), and probed with 32P dCTP-labelled pB1.1 insert DNA. Hybridization was performed overnight at 42° C. in 50% Formamide, 5× SSPE, 0.5% SDS, 5× Denhardt's solution, 100 ug/ml denatured salmon sperm DNA and 2-4×10⁶ cpm of labeled probe/ml of hybridization buffer. Following hybridization, blots were washed twice in 2× SSC, 0.1% SDS at room temperature for 5 min each, and then twice in 0.1× SSC, 0.1% SDS at 55° C. for 5-10 min each. Expression of the transgene in founder and control littermates was determined following autoradiography.

The northern blot data indicate that 7 of the transgenic founders express detectable levels of the transgene mRNA (animal #'s 2,11,16,17,22,33, and 45). The negative control mice and one of the founders (#28) expressed no transgene-related mRNA. Since Osteoprotegerin is predicted to be a secreted protein, overexpression of transgene mRNA should be a proxy for the level of systemically delivered gene product. Of the PCR and northern blot positive mice, animal 2, 17 and 22 expressed the highest levels of transgene mRNA, and may show more extensive biological effects on host cells and tissues.

EXAMPLE 4 Biological Activity of Osteoprotegerin

Five of the transgenic mice (animals 2,11,16,17 and 28) and 5 control littermates (animals 1,12,15,18, and 30) were sacrificed for necropsy and pathological analysis using the following procedures: Prior to euthanasia, all animals had their identification numbers verified, then were weighed, anesthetized and blood drawn. The blood was saved as both serum and whole blood for a complete serum chemistry and hematology panel. Radiography was performed just after terminal anesthesia by lethal CO2 inhalation, and prior to the gross dissection. Following this, tissues were removed and fixed in 10% buffered Zn-Formalin for histological examination. The tissues collected included the liver, spleen, pancreas, stomach, duodenum, ileum, colon, kidney, reproductive organs, skin and mammary glands, bone, brain, heart, lung, thymus, trachea, eosphagus, thyroid, jejunem, cecum, rectum, adrenals, urinary bladder, and skeletal muscle. Prior to fixation the whole organ weights were determined for the liver, stomach, kidney, adrenals, spleen, and thymus. After fixation the tissues were processed into paraffin blocks, and 3 um sections were obtained. Bone tissue was decalcified using a formic acid solution, and all sections were stained with hematoxylin and eosin. In addition, staining with Gomori's reticulin and Masson's trichrome were performed on certain tissues. Enzyme histochemistry was performed to determine the expression of tartrate resistant acid phosphatase (TRAP), an enyzme highly expressed by osteoclasts, multinucleated bone-resorbing cells of monocyte-macrophage lineage. Immunohistochemistry for BrdU and F480 monocyte-macrophage surface antigen was also performed to detect replicating cells and cells of the monocyte-macrophage lineage, respectively. To detect F480 surface antigen expression, formalin fixed, paraffin embedded 4 μm sections were deparaffinized and hydrated to deionized water. The sections were quenched with 3% hydrogen peroxide, blocked with Protein Block (Lipshaw, Pittsburgh, Pa.), and incubated in rat monoclonal anti-mouse F480 (Harlan, Indianapolis, Ind.). This antibody was detected by biotinylated rabbit anti-rat immunoglobulins, peroxidase conjugated strepavidin (BioGenex San Ramon, Calif.) with DAB as chromagen (BioTek, Santa Barbara, Calif.). Sections were counterstained with hematoxylin.

Upon gross dissection and observation of visceral tissues, no abnormalities were found in the transgene expressors or control littermates. Analysis of organ weight indicate that spleen size increased by approximately 38% in the transgenic mice relative to controls. There was a slight enlargement of platelet size and increased circulating unstained cells in the transgene expressors. There was a marginal decrease in platelet levels in the transgene expressors. In addition, the serum uric acid, urea nitrogen, and alkaline phosphatase levels all trended lower in the transgene expressors. The expressors were found to have increased radiodensity of the skeleton, including long bones (femurs), vertebrae, and flat bones (pelvis). The relative size of femurs in the expressors were not different from the the control mice.

Histological analysis of stained sections of bone from the Osteoprotegerin expressors show severe osteopetrosis with the presence of cartilage remnants from the primary spongiosa seen within bone trabeculae in the diaphysis of the femur. A clearly defined cortex was not identifiable in the sections of femur. In normal animals, the central diaphysis is filled with bone marrow. Sections of vertebra also show osteopetrotic changes implying that the Osteoprotegerin-induced skeletal changes were systemic. The residual bone marrow showed predominantly myeloid elements. Megakaryocytes were present. Reticulin stains showed no evidence for reticulin deposition. Immunohistochemistry for F480, a cell surface antigen expressed by cells of monocyte-macrophage derivation in the mouse, showed the presence of F480 positive cells in the marrow spaces. Focally, flattened F480 positive cells could be seen directly adjacent to trabecular bone surfaces.

The mesenchymal cells lining the bony trabeculae were flattened and appeared inactive. Based on H&E and TRAP stains, osteoclasts were rarely found on the trabecular bone surfaces in the Osteoprotegerin expressors. In contrast, osteoclasts and/or chondroclasts were seen in the region of the growth plate resorbing cartilage, but their numbers may be reduced compared to controls. Also, osteoclasts were present on the cortical surface of the metaphysis where modelling activity is usually robust. The predominant difference between the expressors and controls was the profound decrease in trabecular osteoclasts, both in the vertebrae and femurs. The extent of bone accumulation was directly correlated with the level of Osteoprotegerin transgene mRNA detected by northern blotting of total liver RNA.

The spleens from the Osteoprotegerin expressors had an increased amount of red pulp with the expansion due to increased hematopoiesis. All hematopoietic lineages are represented. F480 positive cells were present in both control and Osteoprotegerin expressors in the red pulp. Two of the expressors (2 and 17)had foci of extramedullary hematopoiesis within the liver and this is likely due to the osteopetrotic marrow.

There were no observable abnormalities in the thymus, lymph nodes, gastrointestinal tract, pancreato-hepatobiliary tract, respiratory tract, reproductive system, genito-urinary system, skin, nervous system, heart and aorta, breast, skeletal muscle and fat.

EXAMPLE 5 Isolation of Mouse and Human Osteoprotegerin cDNA

A cDNA clone corresponding to the 5' end of the mouse Osteoprotegerin mRNA was isolated from a mouse kidney cDNA library (Clontech) by PCR amplification. The oligonucleotides were derived from the rat Osteoprotegerin cDNA sequence and are shown below:

5'-ATCAAAGGCAGGGCATACTTCCTG-3' (SEQ ID No:19)

5'-GTTGCACTCCTGTTTCACGGTCTG-3' (SEQ ID No:20)

5'-CAAGACACCTTGAAGGGCCTGATG-3' (SEQ ID No:21)

5'-TAACTTTTACAGAAGAGCATCAGC-3' (SEQ ID No:22)

5'-AGCGCGGCCGCATGAACAAGTGGCTGTGCTGCG-3' (SEQ ID No:23)

5'-AGCTCTAGAGAAACAGCCCAGTGACCATTCC-3' (SEQ ID No:24)

The partial and full-length cDNA products obtained in this process were sequenced. The full-length product was digested with Not I and Xba I, then directionally cloned into the plasmid vector pRcCMV (Invitrogen). The resulting plasmid was named pRcCMV-Mu-Osteoprotegerin. The nucleotide sequence of the cloned product was compared to the rat Osteoprotegerin cDNA sequence. Over the 1300 bp region spanning the Osteoprotegerin LORF, the rat and mouse DNA sequences are approximately 88% identical. The mouse cDNA sequence contained a 401 aa LORF, which was compared to the rat Osteoprotegerin protein sequence and found to be ˜94% identical without gaps. This indicates that the mouse cDNA sequence isolated encodes the murine Osteoprotegerin protein, and that the sequence and structure has been highly conserved throughout evolution. The mouse Osteoprotegerin protein sequence contains an identical putative signal peptide at its N-terminus, and all 4 potential sites of N-linked glycosylation are conserved.

A partial human Osteoprotegerin cDNA was cloned from a human kidney cDNA library using the following rat-specific oligonucleotides:

5'-GTG AAG CTG TGC AAG AAC CTG ATG-3' (SEQ ID No:25)

5'-ATC AAA GGC AGG GCA TAC TTC CTG-3' (SEQ ID No:26)

This PCR product was sequenced and used to design primers for amplifying the 3' end of the human cDNA using a human osteoprotegerin genomic clone in lambda as template:

5'-TCCGTAAGAAACAGCCCAGTGACC-3' (SEQ ID No:27)

5'-CAGATCCTGAAGCTGCTCAGTTTG-3' (SEQ ID No:28)

The amplified PCR product was sequenced, and together with the 5' end sequence, was used to design 5' and 3' human-specific primers useful for amplifying the entire human Osteoprotegerin cDNA coding sequences:

5'-AGCGCGGCCGCGGGGACCACAATGAACAAGTTG-3' (SEQ ID No:29)

5'-AGCTCTAGAATTGTGAGGAAACAGCTCAATGGC-3' (SEQ ID No:30)

The full-length human PCR product was sequenced, then directionally cloned into the plasmid vector pRcCMV (Invitrogen) using Not I and Xba I. The resulting plasmid was named pRcCMV-human Osteoprotegerin. The nucleotide sequence of the cloned product was compared to the rat and mouse Osteoprotegerin cDNA sequences. Over the 1300 bp region spanning the Osteoprotegerin LORF, the rat and mouse DNA sequences are approximately 78-88% identical to the human Osteoprotegerin cDNA. The human Osteoprotegerin cDNA sequence also contained a 401 aa LORF, and it was compared to the rat and mouse protein sequences. The predicted human Osteoprotegerin protein is approximatlely 85% identical, and ˜90% identical to the rat and mouse proteins, respectively. Sequence alignment of rat, mouse and human proteins show that they have been highly conserved during evolution. The human protein is predicted to have a N-terminal signal peptide, and 5 potential sites of N-linked glycosylation, 4 of which are conserved between the rat and mouse Osteoprotegerin proteins.

EXAMPLE 6 Production of Recombinant Secreted Osteoprotegerin Protein in Mammalian Cells

To determine if Osteoprotegerin is actually a secreted protein we expressed the mouse cDNA, fused to the human IgG1 Fc domain as a tag (Capon et al. Nature 337, 525-531 (1989)), in human 293 fibroblasts. The cloned mouse cDNA was amplified using the following two sets of primer pairs:

Pair 1

5'-CCTCTGAGCTCAAGCTTCCGAGGACCACAATGAACAAG-3' (SEQ ID No:31)

5'-CCTCTGCGGCCGCTAAGCAGCTTATTTTCACGGATTGAACCTG-3' (SEQ ID No:32)

Pair 2

5'-CCTCTGAGCTCAAGCTTCCGAGGACCACAATGAACAAG-3' (SEQ ID No:33)

5'-CCTCTGCGGCCGCTGTTGCATTTCCTTTCTG-3' (SEQ ID No:34)

The first pair amplifies the entire Osteoprotegerin LORF, and creates a Not I restriction site which is compatable with the in-frame Not I site Fc fusion vector FcA3. FcA3 was prepared by engineering a Not 1 restriction site 5' to aspartic acid reside 216 of the human IgG1 Fc cDNA. This construct introduces a linker which encodes two irrelevant amino acids which span the junction between the Osteoprotegerin protein and the IgG Fc region. This product, when linked to the Fc portion, would encode all 401 Osteoprotegerin residues directly followed by all 227 amino acid residues of the human IgG1 Fc region (Fl.Fc). The second primer pair amplifies the DNA sequences encoding the first 180 amino acid residues of Osteoprotegerin, which encompasses its putative ligand binding domain. As above, the 3' primer creates an artifical Not I restriction site which fuses the C-terminal truncated Osteoprotegerin LORF at position Threoninel180 directly to the IgG1 Fc domain (CT.Fc).

Both products were directionally cloned into the plasmid vector pCEP4 (Invitrogen). pCEP4 contains the Epstein-Barr virus origin of replication, and is capable of episomal replication in 293-EBNA-1 cells. The parent pCEP4, and pCEP4-Fl.Fc and pCEP4-CT.Fc vectors were lipofected into 293-EBNA-1 cells using the manufacturer's recommended methods. The transfected cells were then selected in 100 μg/ml hygromycin to select for vector expression, and the resulting drug-resistant mass cultures were grown to confluence. The cells were then cultured in serum-free media for 72 hr, and the conditioned media removed and analysed by SDS-PAGE. A silver staining of the polyacrylamide gel detects the major conditioned media proteins produced by the drug resistant 293 cultures. In the pCEP4-Fl.Fc and the pCEP4-CT.Fc conditioned media, unique band of the predicted size were abundantly secreted. The full-length Fc fusion protein accumulated to a high concentration, indicating that it may be stable. Both Fc fusion proteins were detected by anti-human IgG1 Fc antibodies (Pierce) on western blots, indicating that they are recombinant Osteoprotegerin products.

The full length Osteoprotegerin-Fc fusion protein was purified by Protein-A column chromatography (Pierce) using the manufacturers recommended procedures. The protein was then subjected to N-terminal sequence analysis by automated Edman degradation as essentially described by Matsudaira et al. (J. Biol. Chem. 262, 10-35 (1987)). The following amino acid sequence was read after 19 cycles:

NH₂ -E T L P P K Y L H Y D P E T G H Q L L-CO₂ H (SEQ ID No:35)

This sequence was identical to the predicted mouse Osteoprotegerin amino acid sequence beginning at amino acid residue 22, suggesting that the natural mammalian leader cleavage site is between amino acid residues Q₂₁ -E₂₂, not between Y31-D32 as originally predicted. The expression experiments performed in 293-EBNA cells with pCEP4-Fl.Fc and pCEP4-CT.Fc demonstrate that Osteoprotegerin is a secreted protein, and may act systemically to bind its unidentified ligand.

EXAMPLE 7 Expression of Human Osteoprotegerin in E. coli

In the example, the expression vector used was pAMG21, a derivative of pCFM1656 (ATCC accession no. 69576) which contains appropriate restriction sites for insertion of genes downstream from the lux PR promoter. (See U.S. Pat. No. 5,169,318 for description of the lux expression system). The host cell used was GM120. This host has the lacI^(Q) promoter and lacI gene integrated into a second site in the host chromosome of a prototrophic E. coli K12 host. Other commonly used E. coli expression vectors and host cells are also suitable for expression.

A DNA sequence coding for an N-terminal methionine and amino acids 32-401 of the human Osteoprotegerin polypeptide was placed under control of the luxPR promoter in the plasmid expression vector pAMG21 as follows. To accomplish this, PCR using oligonucleotides #1257-20 and #1257-19 as primers was performed using as a template plasmid pRcCMV-huCr1 containing the human Osteoprotegerin cDNA and thermocycling for 30 cycles with each cycle being: 94° C. for 20 seconds, followed by 37° C. for 30 seconds, followed by 72° C. for 30 seconds. The resulting PCR sample was resolved on an agarose gel, the PCR product was excised, purified, and restricted with KpnI and BamHI restriction endonucleases and purified. Synthetic oligonucleotides #1257-21 and #1257-22 were phoplorylated individually using T4 polynucleotide kinase and ATP, and were then mixed together, heated at 94° C. and allowed to slow cool to room temperature to form an oligonucleotide linker duplex containing NdeI and KpnI sticky ends. The phosphorylated linker duplex formed between oligonucleotides #1257-21 and #1257-22 containing NdeI and KpnI cohesive ends (see diagram below) and the KpnI and BamHI digested and purified PCR product generated using oligo primers #1257-20 #1257-19 (see above) was directionally inserted between two sites of the plasmid vector pAMG21, namely the NdeI site and BamHI site, using standard recominant DNA methodology (see diagram and sequences below). The synthetic linker utilized E. coli codons and provided for a N-terminal methionine.

Two clones were selected and plasmid DNA isolated, and the human Osteoprotegerin insert was subsequently DNA sequence confirmed. The resulting pAMG21 plasmid containing amino acids 32-401 of the human Osteoprotegerin polypeptide immediately preceeded in frame by a methionine is here to referred to as pAMG21-hu-Osteoprotegerin-32-401 or pAMG21-huCr1-32-401

Oligo#1257-19

5'-TACGCACTGGATCCTTATAAGCAGCTTATTTTTACTGATTGGAC-3'(SEQ ID No:36)

oligo#1257-20

5'-GTCCTCCTGGTACCTACCTAAAACAAC-3' (SEQ ID No:37)

Oligo#1257-21

5'TATGGATGAAGAAACTTCTCATCAGCTGCTGTGTGATAAATGTCCGCCGGGTAC-3' (SEQ ID No:38)

Oligo#1257-22

5'CCGGCGGACATTTATCACACAGCAGCTGATGAGAAGTTTCTTCATCCA-3' (SEQ ID No:39)

Cultures of pAMG21-hu-Osteoprotegerin-32-401 in E. coli GM120 in 2XYT media containing 20 ug/ml kanamycin were incubated at 30° C. prior to induction. Induction of huCr1-co-DN10 gene product expression from the luxPR promoter was achieved following the addition of the synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the culture media to a final concentration of 30 ng/ml and cultures were incubated at either 30° C. or 37° C. for a further 6 hours. After 6 hours, the bacterial cultures were examined by microscopy for the presence of inclusion bodies and were then pelletted by centrifugation. Refractile inclusion bodies were observed in induced cultures indicating that some of the recombinant hu-Osteoprotegerin-32-401 gene product was produced insolubly in E. coli. Some bacterial pellets were resuspended in 10 mM Tris-HCl/pH8, 1 mM EDTA and lysed directly by addition of 2× Laemalli sample buffer to 1× final, and b-mercaptoethanol to 5% final concentration, and analyzed by SDS-PAGE. A substantially more intense coomassie stained band of approximately 42 kDa was observed on a SDS-PAGE gel containing total cell lysates of 30° C. and 37° C. induced cultures versus lane 2 which is a total cell lysate of a 30° C. uninduced culture. The expected gene product would be 370 amino acids in length and have an expected molecular weight of about 42.2 kDa. Following induction at 37° C. for 6 hours, an additional culture was pelletted and either processed for isolation of inclusion bodies (see below) or processed by microfluidizing. The pellet processed for microfluidizing was resuspended in 25 mM Tris-HCl/pH8, 0.5M NaCl buffer and passed 20 times through a Microfluidizer Model 1108 (Microfluidics Corp.) and collected. An aliquot was removed of the collected sample (microfluidized total lysate), and the remainder was pelletted at 20,000×g for 20 minutes. The supernatant following centrifugation was removed (microfluidized soluble fraction) and the pellet resuspended in a 25 mM Tris-HCl/pH8, 0.5M NaCl, 6M urea solution (microfluidized insoluble fraction). To an aliquot of either the total soluble, or insoluble fraction was added to an equal volume of 2× Laemalli sample buffer and -μmercaptoethanol to 5% final concentration. The samples were then analyzed by SDS-PAGE. A significant amount of recombinant hu-Osteoprotegerin-32-401 gene product appeared to be found in the insoluble fraction. To purify the recombinant protein inclusion bodies were purified as follows: Bacterial cells were separated from media by density gradient centrifugation in a Beckman J-6B centrifuge equipped with a JS-4.2 rotor at 4,900×g for 15 minutes at 4° C. The bacterial pellet was resuspended in 5 ml of water and then diluted to a final volume of 10 ml with water. This suspension was transferred to a stainless steel cup cooled in ice and subjected to sonic disruption using a Branson Sonifier equipped with a standard tip (power setting=5, duty cycle=95%, 80 bursts). The sonicated cell suspension was centrifuged in a Beckman Optima TLX ultracentrifuge equipped with a TLA 100.3 rotor at 195,000×g for 5 to 10 minutes at 23° C. The supernatant was discarded and the pellet rinsed with a stream of water from a squirt bottle. The pellets were collected by scraping with a micro spatula and transferred to a glass homogenizer (15 ml capacity). Five ml of Percoll solution (75% liquid Percoll, 0.15 M sodium chloride) was added to the homogenizer and the contents are homogenized until uniformly suspended. The volume was increased to 19.5 ml by the addition of Percoll solution, mixed, and distributed into 3 Beckman Quick-Seal tubes (13×32 mm). Tubes were sealed according to manufacturers instructions. The tubes were spun in a Beckman TLA 100.3 rotor at 23° C., 20,000 rpm (21,600×g), 30 minutes. The tubes were examined for the appropriate banding pattern. To recover the refractile bodies, gradient fractions were recovered and pooled, then diluted with water. The inclusion bodies were pelleted by centrifugation, and the protein concentration estimation following SDS-PAGE.

An aliquot of inclusion bodies isolated as described below was dissolved into 1× Laemalli sample buffer+5% b-mercaptoethanol and resolved on a SDS-PAGE gel and the isolated inclusion bodies provide a highly purified recombinant hu-Osteoprotegerin-32-401 gene product. The major ˜42 kDa band observed after resolving inclusion bodies on a SDS-polyacrylamide gel was excised from a separate gel and the N-terminal amino acid sequence determined essentially as described (Matsudaira et al. J. Biol. Chem. 262, 10-35 (1987)). The following sequence was determined after 19 cycles:

NH2-MYDEETSHQLLCDKCPPGT-COOH (SEQ ID No:40)

This sequence was found to be identical to the first 19 amino acids encoded by the pAMG21-hu-osteoprotegerin-32-401 expression vector, produced by a methionine residue provided by the bacterial expression vector.

While the invention has been described in what is considered to be its preferred embodiments, it is not to be limited to the disclosed embodiments, but on the contrary, is intended to cover various modifications and equivalents included within the spirit and scope of the appended claims, which scope is to be accorded the broadest interpretation so as to encompass all such modifications and equivalents.

    __________________________________________________________________________     #             SEQUENCE LISTING                                                 - (1) GENERAL INFORMATION:                                                     -    (iii) NUMBER OF SEQUENCES: 53                                             - (2) INFORMATION FOR SEQ ID NO:1:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 2432 base                                                          (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: CDS                                                              (B) LOCATION: 124..1326                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                  - ATCAAAGGCA GGGCATACTT CCTGTTGCCC AGACCTTATA TAAAACGTCA TG - #TTCGCCTG          60                                                                           - GGCAGCAGAG AAGCACCTAG CACTGGCCCA GCGGCTGCCG CCTGAGGTTT CC - #AGAGGACC         120                                                                           - ACA ATG AAC AAG TGG CTG TGC TGT GCA CTC CT - #G GTG TTC TTG GAC ATC           168                                                                           #Leu Leu Val Phe Leu Asp IleCys Ala                                            #   15                                                                         - ATT GAA TGG ACA ACC CAG GAA ACC TTT CCT CC - #A AAA TAC TTG CAT TAT           216                                                                           Ile Glu Trp Thr Thr Gln Glu Thr Phe Pro Pr - #o Lys Tyr Leu His Tyr            #                 30                                                           - GAC CCA GAA ACC GGA CGT CAG CTC TTG TGT GA - #C AAA TGT GCT CCT GGC           264                                                                           Asp Pro Glu Thr Gly Arg Gln Leu Leu Cys As - #p Lys Cys Ala Pro Gly            #             45                                                               - ACC TAC CTA AAA CAG CAC TGC ACA GTC AGG AG - #G AAG ACA CTG TGT GTC           312                                                                           Thr Tyr Leu Lys Gln His Cys Thr Val Arg Ar - #g Lys Thr Leu Cys Val            #         60                                                                   - CCT TGC CCT GAC TAC TCT TAT ACA GAC AGC TG - #G CAC ACG AGT GAT GAA           360                                                                           Pro Cys Pro Asp Tyr Ser Tyr Thr Asp Ser Tr - #p His Thr Ser Asp Glu            #     75                                                                       - TGC GTG TAC TGC AGC CCC GTG TGC AAG GAA CT - #G CAG ACC GTG AAA CAG           408                                                                           Cys Val Tyr Cys Ser Pro Val Cys Lys Glu Le - #u Gln Thr Val Lys Gln            # 95                                                                           - GAG TGC AAC CGC ACC CAC AAC CGA GTG TGC GA - #A TGT GAG GAA GGG CGC           456                                                                           Glu Cys Asn Arg Thr His Asn Arg Val Cys Gl - #u Cys Glu Glu Gly Arg            #               110                                                            - TAC CTG GAG CTC GAA TTC TGC TTG AAG CAC CG - #G AGC TGT CCC CCA GGC           504                                                                           Tyr Leu Glu Leu Glu Phe Cys Leu Lys His Ar - #g Ser Cys Pro Pro Gly            #           125                                                                - TTG GGT GTG CTG CAG GCT GGG ACC CCA GAG CG - #A AAC ACG GTT TGC AAA           552                                                                           Leu Gly Val Leu Gln Ala Gly Thr Pro Glu Ar - #g Asn Thr Val Cys Lys            #       140                                                                    - AGA TGT CCG GAT GGG TTC TTC TCA GGT GAG AC - #G TCA TCG AAA GCA CCC           600                                                                           Arg Cys Pro Asp Gly Phe Phe Ser Gly Glu Th - #r Ser Ser Lys Ala Pro            #   155                                                                        - TGT AGG AAA CAC ACC AAC TGC AGC TCA CTT GG - #C CTC CTG CTA ATT CAG           648                                                                           Cys Arg Lys His Thr Asn Cys Ser Ser Leu Gl - #y Leu Leu Leu Ile Gln            160                 1 - #65                 1 - #70                 1 -        #75                                                                            - AAA GGA AAT GCA ACA CAT GAC AAT GTA TGT TC - #C GGA AAC AGA GAA GCA           696                                                                           Lys Gly Asn Ala Thr His Asp Asn Val Cys Se - #r Gly Asn Arg Glu Ala            #               190                                                            - ACT CAA AAT TGT GGA ATA GAT GTC ACC CTG TG - #C GAA GAG GCA TTC TTC           744                                                                           Thr Gln Asn Cys Gly Ile Asp Val Thr Leu Cy - #s Glu Glu Ala Phe Phe            #           205                                                                - AGG TTT GCT GTG CCT ACC AAG ATT ATA CCG AA - #T TGG CTG AGT GTT CTG           792                                                                           Arg Phe Ala Val Pro Thr Lys Ile Ile Pro As - #n Trp Leu Ser Val Leu            #       220                                                                    - GTG GAC AGT TTG CCT GGG ACC AAA GTG AAT GC - #A GAG AGT GTA GAG AGG           840                                                                           Val Asp Ser Leu Pro Gly Thr Lys Val Asn Al - #a Glu Ser Val Glu Arg            #   235                                                                        - ATA AAA CGG AGA CAC AGC TCG CAA GAG CAA AC - #T TTC CAG CTA CTT AAG           888                                                                           Ile Lys Arg Arg His Ser Ser Gln Glu Gln Th - #r Phe Gln Leu Leu Lys            240                 2 - #45                 2 - #50                 2 -        #55                                                                            - CTG TGG AAG CAT CAA AAC AGA GAC CAG GAA AT - #G GTG AAG AAG ATC ATC           936                                                                           Leu Trp Lys His Gln Asn Arg Asp Gln Glu Me - #t Val Lys Lys Ile Ile            #               270                                                            - CAA GAC ATT GAC CTC TGT GAA AGC AGT GTG CA - #A CGG CAT ATC GGC CAC           984                                                                           Gln Asp Ile Asp Leu Cys Glu Ser Ser Val Gl - #n Arg His Ile Gly His            #           285                                                                - GCG AAC CTC ACC ACA GAG CAG CTC CGC ATC TT - #G ATG GAG AGC TTG CCT          1032                                                                           Ala Asn Leu Thr Thr Glu Gln Leu Arg Ile Le - #u Met Glu Ser Leu Pro            #       300                                                                    - GGG AAG AAG ATC AGC CCA GAC GAG ATT GAG AG - #A ACG AGA AAG ACC TGC          1080                                                                           Gly Lys Lys Ile Ser Pro Asp Glu Ile Glu Ar - #g Thr Arg Lys Thr Cys            #   315                                                                        - AAA CCC AGC GAG CAG CTC CTG AAG CTA CTG AG - #C TTG TGG AGG ATC AAA          1128                                                                           Lys Pro Ser Glu Gln Leu Leu Lys Leu Leu Se - #r Leu Trp Arg Ile Lys            320                 3 - #25                 3 - #30                 3 -        #35                                                                            - AAT GGA GAC CAA GAC ACC TTG AAG GGC CTG AT - #G TAC GCA CTC AAG CAC          1176                                                                           Asn Gly Asp Gln Asp Thr Leu Lys Gly Leu Me - #t Tyr Ala Leu Lys His            #               350                                                            - TTG AAA GCA TAC CAC TTT CCC AAA ACC GTC AC - #C CAC AGT CTG AGG AAG          1224                                                                           Leu Lys Ala Tyr His Phe Pro Lys Thr Val Th - #r His Ser Leu Arg Lys            #           365                                                                - ACC ATC AGG TTC TTG CAC AGC TTC ACC ATG TA - #C CGA TTG TAT CAG AAA          1272                                                                           Thr Ile Arg Phe Leu His Ser Phe Thr Met Ty - #r Arg Leu Tyr Gln Lys            #       380                                                                    - CTC TTT CTA GAA ATG ATA GGG AAT CAG GTT CA - #A TCA GTG AAG ATA AGC          1320                                                                           Leu Phe Leu Glu Met Ile Gly Asn Gln Val Gl - #n Ser Val Lys Ile Ser            #   395                                                                        - TGC TTA TAGTTAGGAA TGGTCACTGG GCTGTTTCTT CAGGATGGGC CA - #ACACTGAT           1376                                                                           Cys Leu                                                                        400                                                                            - GGAGCAGATG GCTGCTTCTC CGGCTCTTGA AATGGCAGTT GATTCCTTTC TC - #ATCAGTTG        1436                                                                           - GTGGGAATGA AGATCCTCCA GCCCAACACA CACACTGGGG AGTCTGAGTC AG - #GAGAGTGA        1496                                                                           - GGCAGGCTAT TTGATAATTG TGCAAAGCTG CCAGGTGTAC ACCTAGAAAG TC - #AAGCACCC        1556                                                                           - TGAGAAAGAG GATATTTTTA TAACCTCAAA CATAGGCCCT TTCCTTCCTC TC - #CTTATGGA        1616                                                                           - TGAGTACTCA GAAGGCTTCT ACTATCTTCT GTGTCATCCC TAGATGAAGG CC - #TCTTTTAT        1676                                                                           - TTATTTTTTT ATTCTTTTTT TCGGAGCTGG GGACCGAACC CAGGGCCTTG CG - #CTTGCGAG        1736                                                                           - GCAAGTGCTC TACCACTGAG CTAAATCTCC AACCCCTGAA GGCCTCTTTC TT - #TCTGCCTC        1796                                                                           - TGATAGTCTA TGACATTCTT TTTTCTACAA TTCGTATCAG GTGCACGAGC CT - #TATCCCAT        1856                                                                           - TTGTAGGTTT CTAGGCAAGT TGACCGTTAG CTATTTTTCC CTCTGAAGAT TT - #GATTCGAG        1916                                                                           - TTGCAGACTT GGCTAGACAA GCAGGGGTAG GTTATGGTAG TTTATTTAAC AG - #ACTGCCAC        1976                                                                           - CAGGAGTCCA GTGTTTCTTG TTCCTCTGTA GTTGTACCTA AGCTGACTCC AA - #GTACATTT        2036                                                                           - AGTATGAAAA ATAATCAACA AATTTTATTC CTTCTATCAA CATTGGCTAG CT - #TTGTTTCA        2096                                                                           - GGGCACTAAA AGAAACTACT ATATGGAGAA AGAATTGATA TTGCCCCCAA CG - #TTCAACAA        2156                                                                           - CCCAATAGTT TATCCAGCTG TCATGCCTGG TTCAGTGTCT ACTGACTATG CG - #CCCTCTTA        2216                                                                           - TTACTGCATG CAGTAATTCA ACTGGAAATA GTAATAATAA TAATAGAAAT AA - #AATCTAGA        2276                                                                           - CTCCATTGGA TCTCTCTGAA TATGGGAATA TCTAACTTAA GAAGCTTTGA GA - #TTTCAGTT        2336                                                                           - GTGTTAAAGG CTTTTATTAA AAAGCTGATG CTCTTCTGTA AAAGTTACTA AT - #ATATCTGT        2396                                                                           #     2432         TGCT ATTTATATCC ATCCAG                                      - (2) INFORMATION FOR SEQ ID NO:2:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 401 amino                                                          (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                  - Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Va - #l Phe Leu Asp Ile Ile          #                 15                                                           - Glu Trp Thr Thr Gln Glu Thr Phe Pro Pro Ly - #s Tyr Leu His Tyr Asp          #             30                                                               - Pro Glu Thr Gly Arg Gln Leu Leu Cys Asp Ly - #s Cys Ala Pro Gly Thr          #         45                                                                   - Tyr Leu Lys Gln His Cys Thr Val Arg Arg Ly - #s Thr Leu Cys Val Pro          #     60                                                                       - Cys Pro Asp Tyr Ser Tyr Thr Asp Ser Trp Hi - #s Thr Ser Asp Glu Cys          # 80                                                                           - Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gl - #n Thr Val Lys Gln Glu          #                 95                                                           - Cys Asn Arg Thr His Asn Arg Val Cys Glu Cy - #s Glu Glu Gly Arg Tyr          #           110                                                                - Leu Glu Leu Glu Phe Cys Leu Lys His Arg Se - #r Cys Pro Pro Gly Leu          #       125                                                                    - Gly Val Leu Gln Ala Gly Thr Pro Glu Arg As - #n Thr Val Cys Lys Arg          #   140                                                                        - Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Se - #r Ser Lys Ala Pro Cys          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Arg Lys His Thr Asn Cys Ser Ser Leu Gly Le - #u Leu Leu Ile Gln Lys          #               175                                                            - Gly Asn Ala Thr His Asp Asn Val Cys Ser Gl - #y Asn Arg Glu Ala Thr          #           190                                                                - Gln Asn Cys Gly Ile Asp Val Thr Leu Cys Gl - #u Glu Ala Phe Phe Arg          #       205                                                                    - Phe Ala Val Pro Thr Lys Ile Ile Pro Asn Tr - #p Leu Ser Val Leu Val          #   220                                                                        - Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Gl - #u Ser Val Glu Arg Ile          225                 2 - #30                 2 - #35                 2 -        #40                                                                            - Lys Arg Arg His Ser Ser Gln Glu Gln Thr Ph - #e Gln Leu Leu Lys Leu          #               255                                                            - Trp Lys His Gln Asn Arg Asp Gln Glu Met Va - #l Lys Lys Ile Ile Gln          #           270                                                                - Asp Ile Asp Leu Cys Glu Ser Ser Val Gln Ar - #g His Ile Gly His Ala          #       285                                                                    - Asn Leu Thr Thr Glu Gln Leu Arg Ile Leu Me - #t Glu Ser Leu Pro Gly          #   300                                                                        - Lys Lys Ile Ser Pro Asp Glu Ile Glu Arg Th - #r Arg Lys Thr Cys Lys          305                 3 - #10                 3 - #15                 3 -        #20                                                                            - Pro Ser Glu Gln Leu Leu Lys Leu Leu Ser Le - #u Trp Arg Ile Lys Asn          #               335                                                            - Gly Asp Gln Asp Thr Leu Lys Gly Leu Met Ty - #r Ala Leu Lys His Leu          #           350                                                                - Lys Ala Tyr His Phe Pro Lys Thr Val Thr Hi - #s Ser Leu Arg Lys Thr          #       365                                                                    - Ile Arg Phe Leu His Ser Phe Thr Met Tyr Ar - #g Leu Tyr Gln Lys Leu          #   380                                                                        - Phe Leu Glu Met Ile Gly Asn Gln Val Gln Se - #r Val Lys Ile Ser Cys          385                 3 - #90                 3 - #95                 4 -        #00                                                                            - Leu                                                                          - (2) INFORMATION FOR SEQ ID NO:3:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 1324 base                                                          (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: CDS                                                              (B) LOCATION: 90..1292                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                  - CCTTATATAA ACGTCATGAT TGCCTGGGCT GCAGAGACGC ACCTAGCACT GA - #CCCAGCGG          60                                                                           - CTGCCTCCTG AGGTTTCCCG AGGACCACA ATG AAC AAG TGG CTG - # TGC TGC GCA           113                                                                           #              Met Asn L - #ys Trp Leu Cys Cys Ala                             #             5  1                                                             - CTC CTG GTG CTC CTG GAC ATC ATT GAA TGG AC - #A ACC CAG GAA ACC CTT           161                                                                           Leu Leu Val Leu Leu Asp Ile Ile Glu Trp Th - #r Thr Gln Glu Thr Leu            #     20                                                                       - CCT CCA AAG TAC TTG CAT TAT GAC CCA GAA AC - #T GGT CAT CAG CTC CTG           209                                                                           Pro Pro Lys Tyr Leu His Tyr Asp Pro Glu Th - #r Gly His Gln Leu Leu            # 40                                                                           - TGT GAC AAA TGT GCT CCT GGC ACC TAC CTA AA - #A CAG CAC TGC ACA GTG           257                                                                           Cys Asp Lys Cys Ala Pro Gly Thr Tyr Leu Ly - #s Gln His Cys Thr Val            #                 55                                                           - AGG AGG AAG ACA TTG TGT GTC CCT TGC CCT GA - #C CAC TCT TAT ACG GAC           305                                                                           Arg Arg Lys Thr Leu Cys Val Pro Cys Pro As - #p His Ser Tyr Thr Asp            #             70                                                               - AGC TGG CAC ACC AGT GAT GAG TGT GTG TAT TG - #C AGC CCA GTG TGC AAG           353                                                                           Ser Trp His Thr Ser Asp Glu Cys Val Tyr Cy - #s Ser Pro Val Cys Lys            #         85                                                                   - GAA CTG CAG TCC GTG AAG CAG GAG TGC AAC CG - #C ACC CAC AAC CGA GTG           401                                                                           Glu Leu Gln Ser Val Lys Gln Glu Cys Asn Ar - #g Thr His Asn Arg Val            #    100                                                                       - TGT GAG TGT GAG GAA GGG CGT TAC CTG GAG AT - #C GAA TTC TGC TTG AAG           449                                                                           Cys Glu Cys Glu Glu Gly Arg Tyr Leu Glu Il - #e Glu Phe Cys Leu Lys            105                 1 - #10                 1 - #15                 1 -        #20                                                                            - CAC CGG AGC TGT CCC CCG GGC TCC GGC GTG GT - #G CAA GCT GGA ACC CCA           497                                                                           His Arg Ser Cys Pro Pro Gly Ser Gly Val Va - #l Gln Ala Gly Thr Pro            #               135                                                            - GAG CGA AAC ACA GTT TGC AAA AAA TGT CCA GA - #T GGG TTC TTC TCA GGT           545                                                                           Glu Arg Asn Thr Val Cys Lys Lys Cys Pro As - #p Gly Phe Phe Ser Gly            #           150                                                                - GAG ACT TCA TCG AAA GCA CCC TGT ATA AAA CA - #C ACG AAC TGC AGC ACA           593                                                                           Glu Thr Ser Ser Lys Ala Pro Cys Ile Lys Hi - #s Thr Asn Cys Ser Thr            #       165                                                                    - TTT GGC CTC CTG CTA ATT CAG AAA GGA AAT GC - #A ACA CAT GAC AAC GTG           641                                                                           Phe Gly Leu Leu Leu Ile Gln Lys Gly Asn Al - #a Thr His Asp Asn Val            #   180                                                                        - TGT TCC GGA AAC AGA GAA GCC ACG CAA AAG TG - #T GGA ATA GAT GTC ACC           689                                                                           Cys Ser Gly Asn Arg Glu Ala Thr Gln Lys Cy - #s Gly Ile Asp Val Thr            185                 1 - #90                 1 - #95                 2 -        #00                                                                            - CTG TGT GAA GAG GCC TTC TTC AGG TTT GCT GT - #T CCT ACC AAG ATT ATA           737                                                                           Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Va - #l Pro Thr Lys Ile Ile            #               215                                                            - CCA AAT TGG CTG AGT GTT TTG GTG GAC AGT TT - #G CCT GGG ACC AAA GTG           785                                                                           Pro Asn Trp Leu Ser Val Leu Val Asp Ser Le - #u Pro Gly Thr Lys Val            #           230                                                                - AAT GCC GAG AGT GTA GAG AGG ATA AAA CGG AG - #A CAC AGC TCA CAA GAG           833                                                                           Asn Ala Glu Ser Val Glu Arg Ile Lys Arg Ar - #g His Ser Ser Gln Glu            #       245                                                                    - CAA ACC TTC CAG CTG CTG AAG CTG TGG AAA CA - #T CAA AAC AGA GAC CAG           881                                                                           Gln Thr Phe Gln Leu Leu Lys Leu Trp Lys Hi - #s Gln Asn Arg Asp Gln            #   260                                                                        - GAA ATG GTG AAG AAG ATC ATC CAA GAC ATT GA - #C CTC TGT GAA AGC AGC           929                                                                           Glu Met Val Lys Lys Ile Ile Gln Asp Ile As - #p Leu Cys Glu Ser Ser            265                 2 - #70                 2 - #75                 2 -        #80                                                                            - GTG CAG CGG CAT CTC GGC CAC TCG AAC CTC AC - #C ACA GAG CAG CTT CTT           977                                                                           Val Gln Arg His Leu Gly His Ser Asn Leu Th - #r Thr Glu Gln Leu Leu            #               295                                                            - GCC TTG ATG GAG AGC CTG CCT GGG AAG AAG AT - #C AGC CCA GAA GAG ATT          1025                                                                           Ala Leu Met Glu Ser Leu Pro Gly Lys Lys Il - #e Ser Pro Glu Glu Ile            #           310                                                                - GAG AGA ACG AGA AAG ACC TGC AAA TCG AGC GA - #G CAG CTC CTG AAG CTA          1073                                                                           Glu Arg Thr Arg Lys Thr Cys Lys Ser Ser Gl - #u Gln Leu Leu Lys Leu            #       325                                                                    - CTC AGT TTA TGG AGG ATC AAA AAT GGT GAC CA - #A GAC ACC TTG AAG GGC          1121                                                                           Leu Ser Leu Trp Arg Ile Lys Asn Gly Asp Gl - #n Asp Thr Leu Lys Gly            #   340                                                                        - CTG ATG TAT GCC CTC AAG CAC TTG AAA ACA TC - #C CAC TTT CCC AAA ACT          1169                                                                           Leu Met Tyr Ala Leu Lys His Leu Lys Thr Se - #r His Phe Pro Lys Thr            345                 3 - #50                 3 - #55                 3 -        #60                                                                            - GTC ACC CAC AGT CTG AGG AAG ACC ATG AGG TT - #C CTG CAC AGC TTC ACA          1217                                                                           Val Thr His Ser Leu Arg Lys Thr Met Arg Ph - #e Leu His Ser Phe Thr            #               375                                                            - ATG TAC AGA CTG TAT CAG AAG CTC TTT TTA GA - #A ATG ATA GGG AAT CAG          1265                                                                           Met Tyr Arg Leu Tyr Gln Lys Leu Phe Leu Gl - #u Met Ile Gly Asn Gln            #           390                                                                - GTT CAA TCC GTG AAA ATA AGC TGC TTA TAACTAGGA - #A TGGTCACTGG                1312                                                                           Val Gln Ser Val Lys Ile Ser Cys Leu                                            #       400                                                                    #     1324                                                                     - (2) INFORMATION FOR SEQ ID NO:4:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 401 amino                                                          (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                  - Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Va - #l Leu Leu Asp Ile Ile          #                 15                                                           - Glu Trp Thr Thr Gln Glu Thr Leu Pro Pro Ly - #s Tyr Leu His Tyr Asp          #             30                                                               - Pro Glu Thr Gly His Gln Leu Leu Cys Asp Ly - #s Cys Ala Pro Gly Thr          #         45                                                                   - Tyr Leu Lys Gln His Cys Thr Val Arg Arg Ly - #s Thr Leu Cys Val Pro          #     60                                                                       - Cys Pro Asp His Ser Tyr Thr Asp Ser Trp Hi - #s Thr Ser Asp Glu Cys          # 80                                                                           - Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gl - #n Ser Val Lys Gln Glu          #                 95                                                           - Cys Asn Arg Thr His Asn Arg Val Cys Glu Cy - #s Glu Glu Gly Arg Tyr          #           110                                                                - Leu Glu Ile Glu Phe Cys Leu Lys His Arg Se - #r Cys Pro Pro Gly Ser          #       125                                                                    - Gly Val Val Gln Ala Gly Thr Pro Glu Arg As - #n Thr Val Cys Lys Lys          #   140                                                                        - Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Se - #r Ser Lys Ala Pro Cys          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Ile Lys His Thr Asn Cys Ser Thr Phe Gly Le - #u Leu Leu Ile Gln Lys          #               175                                                            - Gly Asn Ala Thr His Asp Asn Val Cys Ser Gl - #y Asn Arg Glu Ala Thr          #           190                                                                - Gln Lys Cys Gly Ile Asp Val Thr Leu Cys Gl - #u Glu Ala Phe Phe Arg          #       205                                                                    - Phe Ala Val Pro Thr Lys Ile Ile Pro Asn Tr - #p Leu Ser Val Leu Val          #   220                                                                        - Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Gl - #u Ser Val Glu Arg Ile          225                 2 - #30                 2 - #35                 2 -        #40                                                                            - Lys Arg Arg His Ser Ser Gln Glu Gln Thr Ph - #e Gln Leu Leu Lys Leu          #               255                                                            - Trp Lys His Gln Asn Arg Asp Gln Glu Met Va - #l Lys Lys Ile Ile Gln          #           270                                                                - Asp Ile Asp Leu Cys Glu Ser Ser Val Gln Ar - #g His Leu Gly His Ser          #       285                                                                    - Asn Leu Thr Thr Glu Gln Leu Leu Ala Leu Me - #t Glu Ser Leu Pro Gly          #   300                                                                        - Lys Lys Ile Ser Pro Glu Glu Ile Glu Arg Th - #r Arg Lys Thr Cys Lys          305                 3 - #10                 3 - #15                 3 -        #20                                                                            - Ser Ser Glu Gln Leu Leu Lys Leu Leu Ser Le - #u Trp Arg Ile Lys Asn          #               335                                                            - Gly Asp Gln Asp Thr Leu Lys Gly Leu Met Ty - #r Ala Leu Lys His Leu          #           350                                                                - Lys Thr Ser His Phe Pro Lys Thr Val Thr Hi - #s Ser Leu Arg Lys Thr          #       365                                                                    - Met Arg Phe Leu His Ser Phe Thr Met Tyr Ar - #g Leu Tyr Gln Lys Leu          #   380                                                                        - Phe Leu Glu Met Ile Gly Asn Gln Val Gln Se - #r Val Lys Ile Ser Cys          385                 3 - #90                 3 - #95                 4 -        #00                                                                            - Leu                                                                          - (2) INFORMATION FOR SEQ ID NO:5:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 1355 base                                                          (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: CDS                                                              (B) LOCATION: 94..1296                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                  - GTATATATAA CGTGATGAGC GTACGGGTGC GGAGACGCAC CGGAGCGCTC GC - #CCAGCCGC          60                                                                           - CGCTCCAAGC CCCTGAGGTT TCCGGGGACC ACA ATG AAC AAG TT - #G CTG TGC TGC          114                                                                           # Met Asn Lys Leu Leu Cys Cys                                                  #5  1                                                                          - GCG CTC GTG TTT CTG GAC ATC TCC ATT AAG TG - #G ACC ACC CAG GAA ACG           162                                                                           Ala Leu Val Phe Leu Asp Ile Ser Ile Lys Tr - #p Thr Thr Gln Glu Thr            #         20                                                                   - TTT CCT CCA AAG TAC CTT CAT TAT GAC GAA GA - #A ACC TCT CAT CAG CTG           210                                                                           Phe Pro Pro Lys Tyr Leu His Tyr Asp Glu Gl - #u Thr Ser His Gln Leu            #     35                                                                       - TTG TGT GAC AAA TGT CCT CCT GGT ACC TAC CT - #A AAA CAA CAC TGT ACA           258                                                                           Leu Cys Asp Lys Cys Pro Pro Gly Thr Tyr Le - #u Lys Gln His Cys Thr            # 55                                                                           - GCA AAG TGG AAG ACC GTG TGC GCC CCT TGC CC - #T GAC CAC TAC TAC ACA           306                                                                           Ala Lys Trp Lys Thr Val Cys Ala Pro Cys Pr - #o Asp His Tyr Tyr Thr            #                 70                                                           - GAC AGC TGG CAC ACC AGT GAC GAG TGT CTA TA - #C TGC AGC CCC GTG TGC           354                                                                           Asp Ser Trp His Thr Ser Asp Glu Cys Leu Ty - #r Cys Ser Pro Val Cys            #             85                                                               - AAG GAG CTG CAG TAC GTC AAG CAG GAG TGC AA - #T CGC ACC CAC AAC CGC           402                                                                           Lys Glu Leu Gln Tyr Val Lys Gln Glu Cys As - #n Arg Thr His Asn Arg            #        100                                                                   - GTG TGC GAA TGC AAG GAA GGG CGC TAC CTT GA - #G ATA GAG TTC TGC TTG           450                                                                           Val Cys Glu Cys Lys Glu Gly Arg Tyr Leu Gl - #u Ile Glu Phe Cys Leu            #   115                                                                        - AAA CAT AGG AGC TGC CCT CCT GGA TTT GGA GT - #G GTG CAA GCT GGA ACC           498                                                                           Lys His Arg Ser Cys Pro Pro Gly Phe Gly Va - #l Val Gln Ala Gly Thr            120                 1 - #25                 1 - #30                 1 -        #35                                                                            - CCA GAG CGA AAT ACA GTT TGC AAA AGA TGT CC - #A GAT GGG TTC TTC TCA           546                                                                           Pro Glu Arg Asn Thr Val Cys Lys Arg Cys Pr - #o Asp Gly Phe Phe Ser            #               150                                                            - AAT GAG ACG TCA TCT AAA GCA CCC TGT AGA AA - #A CAC ACA AAT TGC AGT           594                                                                           Asn Glu Thr Ser Ser Lys Ala Pro Cys Arg Ly - #s His Thr Asn Cys Ser            #           165                                                                - GTC TTT GGT CTC CTG CTA ACT CAG AAA GGA AA - #T GCA ACA CAC GAC AAC           642                                                                           Val Phe Gly Leu Leu Leu Thr Gln Lys Gly As - #n Ala Thr His Asp Asn            #       180                                                                    - ATA TGT TCC GGA AAC AGT GAA TCA ACT CAA AA - #A TGT GGA ATA GAT GTT           690                                                                           Ile Cys Ser Gly Asn Ser Glu Ser Thr Gln Ly - #s Cys Gly Ile Asp Val            #   195                                                                        - ACC CTG TGT GAG GAG GCA TTC TTC AGG TTT GC - #T GTT CCT ACA AAG TTT           738                                                                           Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe Al - #a Val Pro Thr Lys Phe            200                 2 - #05                 2 - #10                 2 -        #15                                                                            - ACG CCT AAC TGG CTT AGT GTC TTG GTA GAC AA - #T TTG CCT GGC ACC AAA           786                                                                           Thr Pro Asn Trp Leu Ser Val Leu Val Asp As - #n Leu Pro Gly Thr Lys            #               230                                                            - GTA AAC GCA GAG AGT GTA GAG AGG ATA AAA CG - #G CAA CAC AGC TCA CAA           834                                                                           Val Asn Ala Glu Ser Val Glu Arg Ile Lys Ar - #g Gln His Ser Ser Gln            #           245                                                                - GAA CAG ACT TTC CAG CTG CTG AAG TTA TGG AA - #A CAT CAA AAC AAA GCC           882                                                                           Glu Gln Thr Phe Gln Leu Leu Lys Leu Trp Ly - #s His Gln Asn Lys Ala            #       260                                                                    - CAA GAT ATA GTC AAG AAG ATC ATC CAA GAT AT - #T GAC CTC TGT GAA AAC           930                                                                           Gln Asp Ile Val Lys Lys Ile Ile Gln Asp Il - #e Asp Leu Cys Glu Asn            #   275                                                                        - AGC GTG CAG CGG CAC ATT GGA CAT GCT AAC CT - #C ACC TTC GAG CAG CTT           978                                                                           Ser Val Gln Arg His Ile Gly His Ala Asn Le - #u Thr Phe Glu Gln Leu            280                 2 - #85                 2 - #90                 2 -        #95                                                                            - CGT AGC TTG ATG GAA AGC TTA CCG GGA AAG AA - #A GTG GGA GCA GAA GAC          1026                                                                           Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Ly - #s Val Gly Ala Glu Asp            #               310                                                            - ATT GAA AAA ACA ATA AAG GCA TGC AAA CCC AG - #T GAC CAG ATC CTG AAG          1074                                                                           Ile Glu Lys Thr Ile Lys Ala Cys Lys Pro Se - #r Asp Gln Ile Leu Lys            #           325                                                                - CTG CTC AGT TTG TGG CGA ATA AAA AAT GGC GA - #C CAA GAC ACC TTG AAG          1122                                                                           Leu Leu Ser Leu Trp Arg Ile Lys Asn Gly As - #p Gln Asp Thr Leu Lys            #       340                                                                    - GGC CTA ATG CAC GCA CTA AAG CAC TCA AAG AC - #G TAC CAC TTT CCC AAA          1170                                                                           Gly Leu Met His Ala Leu Lys His Ser Lys Th - #r Tyr His Phe Pro Lys            #   355                                                                        - ACT GTC ACT CAG AGT CTA AAG AAG ACC ATC AG - #G TTC CTT CAC AGC TTC          1218                                                                           Thr Val Thr Gln Ser Leu Lys Lys Thr Ile Ar - #g Phe Leu His Ser Phe            360                 3 - #65                 3 - #70                 3 -        #75                                                                            - ACA ATG TAC AAA TTG TAT CAG AAG TTA TTT TT - #A GAA ATG ATA GGT AAC          1266                                                                           Thr Met Tyr Lys Leu Tyr Gln Lys Leu Phe Le - #u Glu Met Ile Gly Asn            #               390                                                            - CAG GTC CAA TCA GTA AAA ATA AGC TGC TTA TA - #ACTGGAAA TGGCCATTGA            1316                                                                           Gln Val Gln Ser Val Lys Ile Ser Cys Leu                                        #           400                                                                #  1355            TGGC GAGATCCCAT GGATGATAA                                   - (2) INFORMATION FOR SEQ ID NO:6:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 401 amino                                                          (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                  - Met Asn Lys Leu Leu Cys Cys Ala Leu Val Ph - #e Leu Asp Ile Ser Ile          #                 15                                                           - Lys Trp Thr Thr Gln Glu Thr Phe Pro Pro Ly - #s Tyr Leu His Tyr Asp          #             30                                                               - Glu Glu Thr Ser His Gln Leu Leu Cys Asp Ly - #s Cys Pro Pro Gly Thr          #         45                                                                   - Tyr Leu Lys Gln His Cys Thr Ala Lys Trp Ly - #s Thr Val Cys Ala Pro          #     60                                                                       - Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp Hi - #s Thr Ser Asp Glu Cys          # 80                                                                           - Leu Tyr Cys Ser Pro Val Cys Lys Glu Leu Gl - #n Tyr Val Lys Gln Glu          #                 95                                                           - Cys Asn Arg Thr His Asn Arg Val Cys Glu Cy - #s Lys Glu Gly Arg Tyr          #           110                                                                - Leu Glu Ile Glu Phe Cys Leu Lys His Arg Se - #r Cys Pro Pro Gly Phe          #       125                                                                    - Gly Val Val Gln Ala Gly Thr Pro Glu Arg As - #n Thr Val Cys Lys Arg          #   140                                                                        - Cys Pro Asp Gly Phe Phe Ser Asn Glu Thr Se - #r Ser Lys Ala Pro Cys          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Arg Lys His Thr Asn Cys Ser Val Phe Gly Le - #u Leu Leu Thr Gln Lys          #               175                                                            - Gly Asn Ala Thr His Asp Asn Ile Cys Ser Gl - #y Asn Ser Glu Ser Thr          #           190                                                                - Gln Lys Cys Gly Ile Asp Val Thr Leu Cys Gl - #u Glu Ala Phe Phe Arg          #       205                                                                    - Phe Ala Val Pro Thr Lys Phe Thr Pro Asn Tr - #p Leu Ser Val Leu Val          #   220                                                                        - Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Gl - #u Ser Val Glu Arg Ile          225                 2 - #30                 2 - #35                 2 -        #40                                                                            - Lys Arg Gln His Ser Ser Gln Glu Gln Thr Ph - #e Gln Leu Leu Lys Leu          #               255                                                            - Trp Lys His Gln Asn Lys Ala Gln Asp Ile Va - #l Lys Lys Ile Ile Gln          #           270                                                                - Asp Ile Asp Leu Cys Glu Asn Ser Val Gln Ar - #g His Ile Gly His Ala          #       285                                                                    - Asn Leu Thr Phe Glu Gln Leu Arg Ser Leu Me - #t Glu Ser Leu Pro Gly          #   300                                                                        - Lys Lys Val Gly Ala Glu Asp Ile Glu Lys Th - #r Ile Lys Ala Cys Lys          305                 3 - #10                 3 - #15                 3 -        #20                                                                            - Pro Ser Asp Gln Ile Leu Lys Leu Leu Ser Le - #u Trp Arg Ile Lys Asn          #               335                                                            - Gly Asp Gln Asp Thr Leu Lys Gly Leu Met Hi - #s Ala Leu Lys His Ser          #           350                                                                - Lys Thr Tyr His Phe Pro Lys Thr Val Thr Gl - #n Ser Leu Lys Lys Thr          #       365                                                                    - Ile Arg Phe Leu His Ser Phe Thr Met Tyr Ly - #s Leu Tyr Gln Lys Leu          #   380                                                                        - Phe Leu Glu Met Ile Gly Asn Gln Val Gln Se - #r Val Lys Ile Ser Cys          385                 3 - #90                 3 - #95                 4 -        #00                                                                            - Leu                                                                          - (2) INFORMATION FOR SEQ ID NO:7:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 36 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                  #       36         CGGC CGCTACANNN NNNNNT                                      - (2) INFORMATION FOR SEQ ID NO:8:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 16 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                  #    16                                                                        - (2) INFORMATION FOR SEQ ID NO:9:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 12 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                  #       12                                                                     - (2) INFORMATION FOR SEQ ID NO:10:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 18 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                 #  18              GT                                                          - (2) INFORMATION FOR SEQ ID NO:11:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 18 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                 #  18              CC                                                          - (2) INFORMATION FOR SEQ ID NO:12:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 20 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                 # 20               AAGG                                                        - (2) INFORMATION FOR SEQ ID NO:13:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 23 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                 #                23ACCG GAC                                                    - (2) INFORMATION FOR SEQ ID NO:14:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 23 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                 #                23CACA TTC                                                    - (2) INFORMATION FOR SEQ ID NO:15:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 30 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                 #           30     AACA AGTGGCTGTG                                             - (2) INFORMATION FOR SEQ ID NO:16:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 45 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                 #45                TAAA CTATGAAACA GCCCAGTGAC CATTC                            - (2) INFORMATION FOR SEQ ID NO:17:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 21 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                 #21                GGGA C                                                      - (2) INFORMATION FOR SEQ ID NO:18:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 21 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                 #21                TGAG C                                                      - (2) INFORMATION FOR SEQ ID NO:19:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                 #                24ACTT CCTG                                                   - (2) INFORMATION FOR SEQ ID NO:20:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                 #                24ACGG TCTG                                                   - (2) INFORMATION FOR SEQ ID NO:21:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                 #                24GCCT GATG                                                   - (2) INFORMATION FOR SEQ ID NO:22:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                 #                24GCAT CAGC                                                   - (2) INFORMATION FOR SEQ ID NO:23:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 33 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                 #         33       CAAG TGGCTGTGCT GCG                                         - (2) INFORMATION FOR SEQ ID NO:24:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 31 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                 #          31      CCCA GTGACCATTC C                                           - (2) INFORMATION FOR SEQ ID NO:25:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                 #                24ACCT GATG                                                   - (2) INFORMATION FOR SEQ ID NO:26:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                 #                24ACTT CCTG                                                   - (2) INFORMATION FOR SEQ ID NO:27:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                 #                24CAGT GACC                                                   - (2) INFORMATION FOR SEQ ID NO:28:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 24 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                 #                24TCAG TTTG                                                   - (2) INFORMATION FOR SEQ ID NO:29:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 33 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                 #         33       CCAC AATGAACAAG TTG                                         - (2) INFORMATION FOR SEQ ID NO:30:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 33 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                 #         33       GGAA ACAGCTCAAT GGC                                         - (2) INFORMATION FOR SEQ ID NO:31:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 38 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                 #     38           TCCG AGGACCACAA TGAACAAG                                    - (2) INFORMATION FOR SEQ ID NO:32:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 43 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                 # 43               GCAG CTTATTTTCA CGGATTGAAC CTG                              - (2) INFORMATION FOR SEQ ID NO:33:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 38 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:                                 #     38           TCCG AGGACCACAA TGAACAAG                                    - (2) INFORMATION FOR SEQ ID NO:34:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 31 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:                                 #          31      TGCA TTTCCTTTCT G                                           - (2) INFORMATION FOR SEQ ID NO:35:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 19 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:                                 - Glu Thr Leu Pro Pro Lys Tyr Leu His Tyr As - #p Pro Glu Thr Gly His          #                15                                                            - Gln Leu Leu                                                                  - (2) INFORMATION FOR SEQ ID NO:36:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 44 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:                                 # 44               ATAA GCAGCTTATT TTTACTGATT GGAC                             - (2) INFORMATION FOR SEQ ID NO:37:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 27 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:                                 #             27   CCTA AAACAAC                                                - (2) INFORMATION FOR SEQ ID NO:38:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 54 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:                                 - TATGGATGAA GAAACTTCTC ATCAGCTGCT GTGTGATAAA TGTCCGCCGG GT - #AC                54                                                                           - (2) INFORMATION FOR SEQ ID NO:39:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 48 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:                                 #                48ACAC AGCAGCTGAT GAGAAGTTTC TTCATCCA                         - (2) INFORMATION FOR SEQ ID NO:40:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 19 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:                                 - Met Tyr Asp Glu Glu Thr Ser His Gln Leu Le - #u Cys Asp Lys Cys Pro          #                15                                                            - Pro Gly Thr                                                                  - (2) INFORMATION FOR SEQ ID NO:41:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 70 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:                                 - Ala Leu Leu Val Phe Leu Asp Ile Ile Glu Tr - #p Thr Thr Gln Glu Thr          #                15                                                            - Phe Pro Pro Lys Tyr Leu His Tyr Asp Pro Gl - #u Thr Gly Arg Gln Leu          #            30                                                                - Leu Cys Asp Lys Cys Ala Pro Gly Thr Tyr Le - #u Lys Gln His Cys Thr          #        45                                                                    - Val Arg Arg Lys Thr Leu Cys Val Pro Cys Pr - #o Asp Tyr Ser Tyr Thr          #    60                                                                        - Asp Ser Trp His Thr Ser                                                      #70                                                                            - (2) INFORMATION FOR SEQ ID NO:42:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 120 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:                                 - His Ala Leu Pro Ala Gln Val Ala Phe Thr Pr - #o Tyr Ala Pro Glu Pro          #                15                                                            - Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr As - #p Gln Thr Ala Gln Met          #            30                                                                - Cys Cys Ser Lys Cys Ser Pro Gly Gln His Al - #a Lys Val Phe Cys Thr          #        45                                                                    - Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Gl - #u Asp Ser Thr Tyr Thr          #    60                                                                        - Gln Leu Trp Asn Trp Val Pro Glu Cys Leu Se - #r Cys Gly Ser Arg Cys          #80                                                                            - Ser Ser Asp Gln Val Glu Thr Gln Ala Cys Th - #r Arg Glu Gln Asn Arg          #                95                                                            - Ile Cys Thr Cys Arg Pro Gly Trp Tyr Cys Al - #a Leu Ser Lys Gln Glu          #           110                                                                - Gly Cys Arg Leu Cys Ala Pro Leu                                              #       120                                                                    - (2) INFORMATION FOR SEQ ID NO:43:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 48 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:                                 - Tyr Leu His Tyr Asp Pro Glu Thr Gly Arg Gl - #n Leu Leu Cys Asp Lys          #                15                                                            - Cys Ala Pro Gly Thr Tyr Leu Lys Gln His Cy - #s Thr Val Arg Arg Lys          #            30                                                                - Thr Leu Cys Val Pro Cys Pro Asp Tyr Ser Ty - #r Thr Asp Ser Trp His          #        45                                                                    - (2) INFORMATION FOR SEQ ID NO:44:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 51 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:                                 - Tyr His Tyr Tyr Asp Gln Asn Gly Arg Met Cy - #s Glu Glu Cys His Met          #                15                                                            - Cys Gln Pro Gly His Phe Leu Val Lys His Cy - #s Lys Gln Pro Lys Arg          #            30                                                                - Asp Thr Val Cys His Lys Pro Cys Glu Pro Gl - #y Val Thr Tyr Thr Asp          #        45                                                                    - Asp Trp His                                                                      50                                                                         - (2) INFORMATION FOR SEQ ID NO:45:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 219 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45:                                 - Met Leu Gly Ile Trp Thr Leu Leu Pro Leu Va - #l Leu Thr Ser Val Ala          #                15                                                            - Arg Leu Ser Ser Lys Ser Val Asn Ala Gln Va - #l Thr Asp Ile Asn Ser          #            30                                                                - Lys Gly Leu Glu Leu Arg Lys Thr Val Thr Th - #r Val Glu Thr Gln Asn          #        45                                                                    - Leu Glu Gly Leu His His Asp Gly Gln Phe Cy - #s His Lys Pro Cys Pro          #    60                                                                        - Pro Gly Glu Arg Lys Ala Arg Asp Cys Thr Va - #l Asn Gly Asp Glu Pro          #80                                                                            - Asp Cys Val Pro Cys Gln Glu Gly Lys Glu Ty - #r Thr Asp Lys Ala His          #                95                                                            - Phe Ser Ser Lys Cys Arg Arg Cys Arg Leu Cy - #s Asp Glu Gly His Gly          #           110                                                                - Leu Glu Val Glu Ile Asn Cys Thr Arg Thr Gl - #n Asn Thr Lys Cys Arg          #       125                                                                    - Cys Lys Pro Asn Phe Phe Cys Asn Ser Thr Va - #l Cys Glu His Cys Asp          #   140                                                                        - Pro Cys Thr Lys Cys Glu His Gly Ile Ile Ly - #s Glu Cys Thr Leu Thr          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Ser Asn Thr Lys Cys Lys Glu Glu Gly Ser Ar - #g Ser Asn Leu Gly Trp          #               175                                                            - Leu Cys Leu Leu Leu Leu Pro Ile Pro Leu Il - #e Val Trp Val Lys Arg          #           190                                                                - Lys Glu Val Gln Lys Thr Cys Arg Lys His Ar - #g Lys Glu Asn Gln Gly          #       205                                                                    - Ser His Glu Ser Pro Thr Leu Asn Pro Glu Th - #r                              #   215                                                                        - (2) INFORMATION FOR SEQ ID NO:46:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 280 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46:                                 - Met Gly Leu Ser Thr Val Pro Asp Leu Leu Le - #u Pro Leu Val Leu Leu          #                15                                                            - Glu Leu Leu Val Gly Ile Tyr Pro Ser Gly Va - #l Ile Gly Leu Val Pro          #            30                                                                - His Leu Gly Asp Arg Glu Lys Arg Asp Ser Va - #l Cys Pro Gln Gly Lys          #        45                                                                    - Tyr Ile His Pro Gln Asn Asn Ser Ile Cys Cy - #s Thr Lys Cys His Lys          #    60                                                                        - Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pr - #o Gly Gln Asp Thr Asp          #80                                                                            - Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Al - #a Ser Glu Asn His Leu          #                95                                                            - Arg His Cys Leu Ser Cys Ser Lys Cys Arg Ly - #s Glu Met Gly Gln Val          #           110                                                                - Glu Ile Ser Ser Cys Thr Val Asp Arg Asp Th - #r Val Cys Gly Cys Arg          #       125                                                                    - Lys Asn Gln Tyr Arg His Tyr Trp Ser Glu As - #n Leu Phe Gln Cys Phe          #   140                                                                        - Asn Cys Ser Leu Cys Leu Asn Gly Thr Val Hi - #s Leu Ser Cys Gln Glu          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Lys Gln Asn Thr Val Cys Thr Cys His Ala Gl - #y Phe Phe Leu Arg Glu          #               175                                                            - Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Ly - #s Ser Leu Glu Cys Thr          #           190                                                                - Lys Leu Cys Leu Pro Gln Ile Glu Asn Val Ly - #s Gly Thr Glu Asp Ser          #       205                                                                    - Gly Thr Thr Val Leu Leu Pro Leu Val Ile Ph - #e Phe Gly Leu Cys Leu          #   220                                                                        - Leu Ser Leu Leu Phe Ile Gly Leu Met Tyr Ar - #g Tyr Gln Arg Trp Lys          225                 2 - #30                 2 - #35                 2 -        #40                                                                            - Ser Lys Leu Tyr Ser Ile Val Cys Gly Lys Se - #r Thr Pro Glu Lys Glu          #               255                                                            - Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Le - #u Ala Pro Asn Pro Ser          #           270                                                                - Phe Ser Pro Thr Pro Gly Phe Thr                                              #       280                                                                    - (2) INFORMATION FOR SEQ ID NO:47:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 207 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:                                 - Met Leu Arg Leu Ile Ala Leu Leu Val Cys Va - #l Val Tyr Val Tyr Gly          #                15                                                            - Asp Asp Val Pro Tyr Ser Ser Asn Gln Gly Ly - #s Cys Gly Gly His Asp          #            30                                                                - Tyr Glu Lys Asp Gly Leu Cys Cys Ala Ser Cy - #s His Pro Gly Phe Tyr          #        45                                                                    - Ala Ser Arg Leu Cys Gly Pro Gly Ser Asn Th - #r Val Cys Ser Pro Cys          #    60                                                                        - Glu Asp Gly Thr Phe Thr Ala Ser Thr Asn Hi - #s Ala Pro Ala Cys Val          #80                                                                            - Ser Cys Arg Gly Pro Cys Thr Gly His Leu Se - #r Glu Ser Gln Pro Cys          #                95                                                            - Asp Arg Thr His Asp Arg Val Cys Asn Cys Se - #r Thr Gly Asn Tyr Cys          #           110                                                                - Leu Leu Lys Gly Gln Asn Gly Cys Arg Ile Cy - #s Ala Pro Gln Thr Lys          #       125                                                                    - Cys Pro Ala Gly Tyr Gly Val Ser Gly His Th - #r Arg Ala Gly Asp Thr          #   140                                                                        - Leu Cys Glu Lys Cys Pro Pro His Thr Tyr Se - #r Asp Ser Leu Ser Pro          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Thr Glu Arg Cys Gly Thr Ser Phe Asn Tyr Il - #e Ser Val Gly Phe Asn          #               175                                                            - Leu Tyr Pro Val Asn Glu Thr Ser Cys Thr Th - #r Thr Ala Gly His Asn          #           190                                                                - Glu Val Ile Lys Thr Lys Glu Phe Thr Val Th - #r Leu Asn Tyr Thr              #       205                                                                    - (2) INFORMATION FOR SEQ ID NO:48:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 227 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:                                 - Met Ala Pro Val Ala Val Trp Ala Ala Leu Al - #a Val Gly Leu Glu Leu          #                15                                                            - Trp Ala Ala Ala His Ala Leu Pro Ala Gln Va - #l Ala Phe Thr Pro Tyr          #            30                                                                - Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Ar - #g Glu Tyr Tyr Asp Gln          #        45                                                                    - Thr Ala Gln Met Cys Cys Ser Lys Cys Ser Pr - #o Gly Gln His Ala Lys          #    60                                                                        - Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cy - #s Asp Ser Cys Glu Asp          #80                                                                            - Ser Thr Tyr Thr Gln Leu Trp Asn Trp Val Pr - #o Glu Cys Leu Ser Cys          #                95                                                            - Gly Ser Arg Cys Ser Ser Asp Gln Val Glu Th - #r Gln Ala Cys Thr Arg          #           110                                                                - Glu Gln Asn Arg Ile Cys Thr Cys Arg Pro Gl - #y Trp Tyr Cys Ala Leu          #       125                                                                    - Ser Lys Gln Glu Gly Cys Arg Leu Cys Ala Pr - #o Leu Arg Lys Cys Arg          #   140                                                                        - Pro Gly Phe Gly Val Ala Arg Pro Gly Thr Gl - #u Thr Ser Asp Val Val          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser As - #n Thr Thr Ser Ser Thr          #               175                                                            - Asp Ile Cys Arg Pro His Gln Ile Cys Asn Va - #l Val Ala Ile Pro Gly          #           190                                                                - Asn Ala Ser Arg Asp Ala Val Cys Thr Ser Th - #r Ser Pro Thr Arg Ser          #       205                                                                    - Met Ala Pro Gly Ala Val His Leu Pro Gln Pr - #o Val Ser Thr Arg Ser          #   220                                                                        - Gln His Thr                                                                  225                                                                            - (2) INFORMATION FOR SEQ ID NO:49:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 197 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49:                                 - Met Val Ser Leu Pro Arg Leu Cys Ala Leu Tr - #p Gly Cys Leu Leu Thr          #                15                                                            - Ala Val His Leu Gly Gln Cys Val Thr Cys Se - #r Asp Lys Gln Tyr Leu          #            30                                                                - His Asp Gly Gln Cys Cys Asp Leu Cys Gln Pr - #o Gly Ser Arg Leu Thr          #        45                                                                    - Ser His Cys Thr Ala Leu Glu Lys Thr Gln Cy - #s His Pro Cys Asp Ser          #    60                                                                        - Gly Glu Phe Ser Ala Gln Trp Asn Arg Glu Il - #e Arg Cys His Gln His          #80                                                                            - Arg His Cys Glu Pro Asn Gln Gly Leu Arg Va - #l Lys Lys Glu Gly Thr          #                95                                                            - Ala Glu Ser Asp Thr Val Cys Thr Cys Lys Gl - #u Gly Gln His Cys Thr          #           110                                                                - Ser Lys Asp Cys Glu Ala Cys Ala Gln His Th - #r Pro Cys Ile Pro Gly          #       125                                                                    - Phe Gly Val Met Glu Met Ala Thr Glu Thr Th - #r Asp Thr Val Cys His          #   140                                                                        - Pro Cys Pro Val Gly Phe Phe Ser Asn Gln Se - #r Ser Leu Phe Glu Lys          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Cys Tyr Pro Trp Thr Ser Cys Glu Asp Lys As - #n Leu Glu Val Leu Gln          #               175                                                            - Lys Gly Thr Ser Gln Thr Asn Val Ile Cys Gl - #y Leu Lys Ser Arg Met          #           190                                                                - Arg Ala Leu Leu Val                                                                  195                                                                    - (2) INFORMATION FOR SEQ ID NO:50:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 224 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50:                                 - Met Gly Ala Gly Ala Thr Gly Arg Ala Met As - #p Gly Pro Arg Leu Leu          #                15                                                            - Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gl - #y Ala Lys Glu Ala Cys          #            30                                                                - Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cy - #s Cys Lys Ala Cys Asn          #        45                                                                    - Leu Gly Glu Gly Val Ala Gln Pro Cys Gly Al - #a Asn Gln Thr Val Cys          #    60                                                                        - Glu Pro Cys Leu Asp Ser Val Thr Phe Ser As - #p Val Val Ser Ala Thr          #80                                                                            - Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gl - #y Leu Gln Ser Met Ser          #                95                                                            - Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cy - #s Arg Cys Ala Tyr Gly          #           110                                                                - Tyr Tyr Gln Asp Glu Thr Thr Gly Arg Cys Gl - #u Ala Cys Arg Val Cys          #       125                                                                    - Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gl - #n Asp Lys Gln Asn Thr          #   140                                                                        - Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Se - #r Asp Glu Ala Asn His          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Val Asp Pro Cys Leu Pro Cys Thr Val Cys Gl - #u Asp Thr Glu Arg Gln          #               175                                                            - Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Gl - #u Cys Glu Glu Ile Pro          #           190                                                                - Gly Arg Trp Ile Thr Arg Ser Thr Pro Pro Gl - #u Gly Ser Asp Ser Thr          #       205                                                                    - Ala Pro Ser Thr Gln Glu Pro Glu Ala Pro Pr - #o Glu Gln Asp Leu Ile          #   220                                                                        - (2) INFORMATION FOR SEQ ID NO:51:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 205 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51:                                 - Met Tyr Val Trp Val Gln Gln Pro Thr Ala Ph - #e Leu Leu Leu Gly Leu          #                15                                                            - Ser Leu Gly Val Thr Val Lys Leu Asn Cys Va - #l Lys Asp Thr Tyr Pro          #            30                                                                - Ser Gly His Lys Cys Cys Arg Glu Cys Gln Pr - #o Gly His Gly Met Val          #        45                                                                    - Ser Arg Cys Asp His Thr Arg Asp Thr Val Cy - #s His Pro Cys Glu Pro          #    60                                                                        - Gly Phe Tyr Asn Glu Ala Val Asn Tyr Asp Th - #r Cys Lys Gln Cys Thr          #80                                                                            - Gln Cys Asn His Arg Ser Gly Ser Glu Leu Ly - #s Gln Asn Cys Thr Pro          #                95                                                            - Thr Glu Asp Thr Val Cys Gln Cys Arg Pro Gl - #y Thr Gln Pro Arg Gln          #           110                                                                - Asp Ser Ser His Lys Leu Gly Val Asp Cys Va - #l Pro Cys Pro Pro Gly          #       125                                                                    - His Phe Ser Pro Gly Ser Asn Gln Ala Cys Ly - #s Pro Trp Thr Asn Cys          #   140                                                                        - Thr Leu Ser Gly Lys Gln Ile Arg His Pro Al - #a Ser Asn Ser Leu Asp          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Thr Val Cys Glu Asp Arg Ser Leu Leu Ala Th - #r Leu Leu Trp Glu Thr          #               175                                                            - Gln Arg Thr Thr Phe Arg Pro Thr Thr Val Pr - #o Ser Thr Thr Val Trp          #           190                                                                - Pro Arg Thr Ser Gln Leu Pro Ser Thr Pro Th - #r Leu Val                      #       205                                                                    - (2) INFORMATION FOR SEQ ID NO:52:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 191 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52:                                 - Met Gly Asn Asn Cys Tyr Asn Val Val Val Il - #e Val Leu Leu Leu Val          #                15                                                            - Gly Cys Glu Lys Val Gly Ala Val Gln Asn Se - #r Cys Asp Asn Cys Gln          #            30                                                                - Pro Gly Thr Phe Cys Arg Lys Tyr Asn Pro Va - #l Cys Lys Ser Cys Pro          #        45                                                                    - Pro Ser Thr Phe Ser Ser Ile Gly Gly Gln Pr - #o Asn Cys Asn Ile Cys          #    60                                                                        - Arg Val Cys Ala Gly Tyr Phe Arg Phe Lys Ly - #s Phe Cys Ser Ser Thr          #80                                                                            - His Asn Ala Glu Cys Glu Cys Ile Glu Gly Ph - #e His Cys Leu Gly Pro          #                95                                                            - Gln Cys Thr Arg Cys Glu Lys Asp Cys Arg Pr - #o Gly Gln Glu Leu Thr          #           110                                                                - Lys Gln Gly Cys Lys Thr Cys Ser Leu Gly Th - #r Phe Asn Asp Gln Asn          #       125                                                                    - Gly Thr Gly Val Cys Arg Pro Trp Thr Asn Cy - #s Ser Leu Asp Gly Arg          #   140                                                                        - Ser Val Leu Lys Thr Gly Thr Thr Glu Lys As - #p Val Val Cys Gly Pro          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Pro Val Val Ser Phe Ser Pro Ser Thr Thr Il - #e Ser Val Thr Pro Glu          #               175                                                            - Gly Gly Pro Gly Gly His Ser Leu Gln Val Le - #u Thr Leu Phe Leu              #           190                                                                - (2) INFORMATION FOR SEQ ID NO:53:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 102 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53:                                 - Thr Ala Thr Gly Gly Ala Thr Gly Ala Ala Gl - #y Ala Ala Ala Cys Thr          #                15                                                            - Thr Cys Thr Cys Ala Thr Cys Ala Gly Cys Th - #r Gly Cys Thr Gly Thr          #            30                                                                - Gly Thr Gly Ala Thr Ala Ala Ala Thr Gly Th - #r Cys Cys Gly Cys Cys          #        45                                                                    - Gly Gly Gly Thr Ala Cys Ala Cys Cys Thr Al - #a Cys Thr Thr Cys Thr          #    60                                                                        - Thr Thr Gly Ala Ala Gly Ala Gly Thr Ala Gl - #y Thr Cys Gly Ala Cys          #80                                                                            - Gly Ala Cys Ala Cys Ala Cys Thr Ala Thr Th - #r Thr Ala Cys Ala Gly          #                95                                                            - Gly Cys Gly Gly Cys Cys                                                                  100                                                                __________________________________________________________________________ 

What is claimed is:
 1. A transgenic non-human mammal having integrated into its genome a nucleic acid sequence encoding osteoprotegerin operatively linked to regulatory elements, wherein expression of said coding sequence increases the level of osteoprotegerin and the bone density of said mammal relative to a non-transgenic mammal of the same species, wherein the coding sequence is selected from the group consisting of:a) a nucleic acid encoding a polypeptide comprising the amino acid sequence from residue 1 to 401 or from residue 22 to 401 of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6; and b) a nucleic acid encoding a polypeptide having the biological activity of inhibiting bone resorption wherein the nucleic acid hybridizes with the nucleic acid in (a) under conditions comprising hybridization at 5× SSC, 50% formamide and 42° C. and washing at 0.5× SSC and 55° C.
 2. The mammal of claim 1 which is a mouse. 